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POLYNUCLEOTIDE ENCODING INSECT 
ECDYSONE RECEPTOR 


This application is a continuation of application Ser. No. 
07/954,937, filed Sep. 30, 1992, now issued as U.S. Pat. No. 
5,514,578, which is a continuation of application Ser. No. 
07/485,749, filed Feb. 26, 1990, now abandoned. 

This invention was made in part with government sup- 
port under Grant DCB 8405370 from the National Science 
Foundation. The government may have certain rights in this 
invention. 


FIELD OF THE INVENTION 


This invention relates generally to the use of recombinant 
DNA methods as applied to the nucleic acid sequences and 
polypeptides characteristic of insect steroid receptor super- 
family members and, more particularly, to uses of such 
receptors and the DNA regulatory elements associated with 
genes whose expression they regulate for the production of 
proteins in cultured cells and, and to uses of such hormone 
receptor proteins and genes in identifying new hormones 
that control insect development. 


BACKGROUND OF THE INVENTION 


The temporal sequence of gene expression determines the 
nature and sequence of steps in the development of the adult 
animal from the fertilized egg. The common fruit fly, Droso- 
phila melanogaster, provides a favorable model system for 
studying this genetic control of development. Various 
aspects of Drosophila development are representative of 
general insect and, in many respects, vertebrate develop- 
ment. 

The steroid hormone 20-OH ecdysone, also known as 
B-ecdysone, controls timing of development in many 
insects. See. generally, Koolman (ed.), Ecdysone: From 
Chemistry to Mode of Action, Thieme Medical Pub., N.Y. 
(1989), which is hereby incorporated herein by reference. 
The generic term “ecdysone” is frequently used as an 
abbreviation for 20-OH ecdysone. Pulses, or rises and falls, 
of the ecdysone concentration over a short period of time in 
insect development are observed at various stages of Droso- 
phila development. 

These stages include embryogenesis, three larval stages 
and two pupal stages. The last pupal stage ends with the 
formation of the adult fly. One studied effect of ecdysone on 
development is that resulting from a pulse at the end of the 
third, or last, larval stage. This pulse triggers the beginning 
of the metamorphosis of the larva to the adult fly. Certain 
tissues, called imaginal tissues, are induced to begin their 
formation of adult structures such as eyes, wings and legs. 

During the larval stages of development, giant polytene 
chromosomes develop in the non-imaginal larval tissues. 
These cable-like chromosomes consist of aggregates com- 
prising up to about 2,000 chromosomal copies. These chro- 
mosome aggregates are extremely useful because they pro- 
vide the means whereby the position of a given gene within 
à chromosome can be determined to a very high degree of 
resolution, several orders of magnitude higher than is typi- 
cally possible for normal chromosomes. 

A "puff" in the polytene chromosomes is a localized 
expansion or swelling of these cable-like polytene chromo- 
some aggregates that is associated with the transcription of 
a gene at the puff locus. A puff is, therefore, an indicator of 
the transcription of a gene located at a particular position in 
the chromosome. 

A genetic regulatory model was proposed to explain the 
temporal sequence of polytene puffs induced by the ecdys- 
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one pulse which triggers the larval-to-adult metamorphosis. 
See, Ashburner et al., *On the Temporal Control of Puffing 
Activity in Polytene Chromosomes," Cold Spring Harbor 
Symp. Quant. Biol. 38:655-662 (1974). This model pro- 
posed that ecdysone interacts reversibly with a receptor 
protein, the ecdysone receptor, to form an ecdysone-receptor 
complex. This complex would directly induce the transcrip- 
tion of a small set of *early" genes responsible for a half 
dozen immediately induced “early” puffs. These early genes 
are postulated to encode regulatory proteins that induce the 
transcription of a second set of “late” genes responsible for 
the formation of the “late” puffs that appear after the early 
puffs. The model thus defines a genetic regulatory hierarchy 
of three ranks, where the ecdysone-receptor gene is in the 
first rank, the early genes in the second rank and the late 
genes in the third. While this model derived form the puffing 
pattern observed in a non-imaginal tissue, similar genetic 
regulatory hierarchies may also determine the metamorphic 
changes in development of the imaginal tissues that are also 
targets of ecdysone, as well as the changes in tissue devel- 
opment induced by the pulses of ecdysone that occur at other 
developmental stages. 

Various structural data have been derived from vertebrate 
steroid and other lipophilic receptor proteins. A *superfam- 
ily" of such receptors has been defined on the basis of their 
structural similarities. See, Evans, *The Steroid and Thyroid 
Hormone Receptor Superfamily,” Science 240:889-895 
(1988); Green and Chambon, *Nuclear Receptors Enhance 
Our Understanding of Transcription Regulation," Trends in 
Genetics 4:309—314 (1988), both of which are hereby incor- 
porated herein by reference. Where their functions have 
been defined, these receptors, complexed with their respec- 
tive hormones, regulate the transcription of their primary 
target genes, as proposed for the ecdysone receptor in the 
above model. 

Cultivated agriculture has greatly increased efficiency of 
food production in the world. However, various insect pests 
have found it advantageous to seek out and exploit culti- 
vated sources of food to their own advantage. These insect 
pests typically develop by a temporal sequence of events 
which are characteristic of their order. Many, including 
Drosophila, initially develop in a caterpillar or maggot-like 
larval form. Thereafter, they undergo a significant metamor- 
phosis from which an adult emerges having characteristic 
anatomical features. Anatomic similarity is a reflection of 
developmental, physiological and biochemical similarities 
shared by these creatures. In particular, the principles of the 
insect ecdysteroid-hormone receptors and development, as 
described by Ashburner above, likely would be shared by 
many different types of insects. 

As one weapon against the destruction of cultivated crops 
by insects, organic molecules with pesticidal properties are 
used commonly in attempts to eliminate the insect popula- 
tions. However, the ecological side effects of these 
pesticides, due in part to their broad activity and lack of 
specificity, and in part, to the fact that some of these 
pesticides are not easily biodegradable, significantly affect 
populations of both insect and other species of animals. 
Some of these organisms may be advantageous from an 
ecological or other perspective. Furthermore, as the insect 
populations evolve in directions to minimize the effects of 
the applied pesticides, the amounts of pesticides applied are 
often elevated so high as to cause significant effects on other 
animals, including humans, which are affected directly or 
indirectly by the application of the pesticides. Thus, an 
important need exists for both highly specific pesticides or 
highly active pesticides which have biological effects only 
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on the species of animals targeted by the pesticides, and are 
biodegradable. Novel insect hormones which, like the 
ecdysteroids, act by complexing with insect members of the 
steroid receptor superfamily to control insect development, 
are likely candidates for pesticides with these desirable 
properties. 


From a different perspective, many medically and com- 
mercially important proteins can be produced in a usable 
form by genetically engineered bacteria. However, many 
expressed proteins are processed incorrectly in bacteria and 
are preferably produced by genetically engineered eucary- 
otic cells. Typically, yeast cells or mammalian tissue-culture 
cells are used. Because it has been observed that protein 
processing of foreign proteins in yeast cells is also fre- 
quently inappropriate, mammalian cultured cells have 
become the central focus for protein production. It is com- 
mon that the production of large amounts of foreign proteins 
makes these cells unhealthy, which may affect adversely the 
yield of the desired protein. This problem may be 
circumvented, in part, by using an inducible expression 
system. In such a system, the cells are engineered so that 
they do not express the foreign protein, and therefore are not 
unhealthy, until an inducing agent is added to the growth 
medium. In this way, large quantities of healthy cells can be 
produced and then induced to produce large amounts of the 
foreign protein. Unfortunately, in the presently available 
systems, the inducing agents themselves, such as metal ions 
or high temperature, adversely affect the cells, thus again 
lowering the yield of the desired foreign protein the cells 
produce. A need therefore exists for the development of 
innocuous inducing factors for efficient production of 
recombinant proteins. Such innocuous factors could also 
prove invaluable for human therapy, where the individual 
suffers from lack of the ability to produce particular proteins 
by using methods similar to those for producing proteins in 
cultured cells, such innocuous factors for inducing thee 
synthesis of the required protein could be used for control- 
ling both the timing and the abundance of the protein 
produced in the affected individual. 


The hormones that complex with mammalian or other 
vertebrate members of the steroid receptor superfamily are 
unlikely candidates for such innocuous factors, nor have 
they been found to satisfy the required properties of such 
factors, because mammalian cells contain these receptors, or 
highly homologous proteins, that would alter the expression 
of many target genes in the presence of the respective 
hormone, thereby adversely affecting the host cells. 


For these and other reasons, obtaining steroid receptors or 
nucleic acid information about them has been a goal of 
researchers for several years. Unfortunately, efforts have 
been unsuccessful despite significant investment of 
resources. The absence of information on the structure and 
molecular biology of steroid receptors has significantly 
hindered the ability to produce such products. 

Thus, there exists a need for detailed sequence informa- 
tion on insect members of the steroid receptor superfamily, 
and the genes that encode these receptors and for resulting 
reagents useful in finding new molecules which may act as 
agonists or antagonists of natural insect members of the 
steroid receptor superfamily, or as components of systems 
for highly specific regulation of recombinant proteins in 
mammalian cells. 


SUMMARY OF THE INVENTION 


In accordance with the present invention, isolated recom- 
binant nucleic acids are provided which, upon expression, 
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are capable of coding for other than a native vertebrate 
steroid receptor or fragment thereof. These nucleic acids 
typically comprise a segment having a sequence substan- 
tially homologous to one or more coding regions of domains 
A, B, D, E or F from an insect steroid receptor superfamily 
member gene having steroid binding domain homology. 
Preferably, the nucleic acids encode a polypeptide capable of 
binding to a ligand for an insect steroid receptor superfamily 
member and are capable of hybridizing to an insect steroid 
receptor superfamily member gene segment under selective 
hybridization conditions, usually stringent hybridization 
conditions. Mammalian cells transformed with the nucleic 
acids are also provided. 
In another embodiment, isolated recombinant nucleic 
acids are included that have sequence exhibiting identity 
over about 20 nucleotides of a coding segment of an insect 
steroid receptor superfamily member having steroid binding 
domain homology. The nucleic acids can be transformed 
into cells to express a polypeptide which binds to a control 
element responsive to a ligand of an insect steroid receptor 
superfamily. 
Alternatively, an isolated DNA molecule is provided 
comprising a DNA sequence capable of binding to an insect 
steroid receptor superfamily member other than 20-OH 
ecdysone receptor, such as DHR3, E75A or E75B. The DNA 
sequence may be present in an expression vector and pro- 
mote transcription of an operably linked sequence (e.g., 
encoding a polypeptide) in response to binding by an insect 
steroid receptor superfamily member. Also contemplated are 
recombinant nucleic acids comprising a controlling element 
responsive to a ligand of an insect steroid receptor super- 
family member ligand responsive controlling element (e.g., 
an alcohol dehydrogenase promoter), a non-heat shock pro- 
moter sequence (e.g., an alcohol dehydrogenase promoter) 
and a sequence comprising a reporter gene. 
Additional embodiments of the present invention include 
polypeptides comprising an insect steroid receptor super- 
family member or fragment thereof, wherein such polypep- 
tide is substantially free of naturally-associated insect cell 
components and exhibits a biological activity characteristic 
of an insect steroid receptor superfamily member with a 
hormone binding domain. Preferably, the insect steroid 
receptor superfamily member or fragment thereof also com- 
prises a DNA binding domain and the polypeptide is capable 
of binding to a hormone analogue selected from the group 
consisting of an insect hormone, an insect hormone agonist 
and an insect hormone antagonist. The polypeptide can 
comprise a zinc-finger domain and usually is capable of 
binding to a DNA controlling element responsive to an 
insect hormone. As desired, the polypeptide may be fused to 
a second polypeptide, typically a heterologous polypeptide 
which comprises a second steroid receptor superfamily 
member. 
Fragments of such polypeptides can have a sequence 
substantially homologous to consensus E1, E2 or E3 region 
sequences. By way of example, a preferred fragment has a 
sequence comprising: 
a segment at least about 25% homologous to a consensus 
E1 region sequence; 

a segment at least about 3076 homologous to a consensus 
E2 region sequence; and 

a segment at least about 3076 homologous to a consensus 
E3 region sequence. 

The polypeptides of the present invention have a variety 
of utilities. For example, a method for selecting DNA 
sequences capable of being specifically bound by an insect 
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steroid receptor superfamily member can comprise the steps 
of screening DNA sequences for binding to such polypep- 
tides and selecting DNA sequences exhibiting such binding. 
Alternatively, methods for selecting ligands specific for 
binding to a hormone binding domain of an insect steroid 
receptor superfamily member can comprise the steps of 
screening compounds for binding to one or more superfam- 
ily members and selecting compounds exhibiting specific 
binding to the members. Also included are methods for 
modulating insect physiology or development (e.g., killing) 
comprising the steps of screening compounds for binding to 
an insect steroid receptor superfamily member, selecting 
compounds exhibiting said binding and administering the 
ligand to an insect. 

Additionally provided are methods for selecting ligands 
specific for binding to a ligand binding domain of an insect 
steroid receptor superfamily member comprising combin- 
ing: 

(i) a fusion polypeptide which comprises a ligand binding 
domain functionally linked to a DNA binding domain 
of a second steroid receptor superfamily member; and 

(ii) a second nucleic acid sequence encoding a second 
polypeptide, wherein expression of the second nucleic 
acid sequence is responsive to binding by the DNA 
binding domain; 

screening compounds for an activity of inducing expres- 
sion of said second polypeptide; and 

selecting said compounds. 

Also provided are methods for producing a polypeptide 
comprising the steps of: 

selecting a cell, typically a mammalian or plant cell which 
is substantially insensitive to exposure of an insect 
steroid receptor superfamily ligand; 

introducing into said cell; 

(1) a receptor for the ligand; and 

(ii) a nucleic acid sequence encoding the polypeptide, 
the nucleic acid sequence operably linked to a con- 
trolling element responsive to presence of the 
selected ligand, wherein a transformed cell is pro- 
duced; and 


exposing the transformed cell to the ligand. 


BRIEF DESCRIPTION OF THE DRAWINGS 


FIG. 1. pMTEcR, a Cu**-inducible EcR expression plas- 
mid. The P,,7, EcR ORF and Act 5c poly A elements are 
defined in Experimental Example III, part A. The HYG” 
ORF confers hygromycin resistance and is under control of 
the promoter in the LTR of Drosophila transposable 
elements, copia. The SV40 intron/poly A element provides 
an intron for a possible splicing requirement, as well as a 
polyadenylation/cleavage sequence for the HYG' ORF 
mRNA. The pAT153 DNA derives from a bacterial plasmid. 

FIG. 2. The ecdysone-inducible pEcRE/Adh/pgal reporter 
plasmid. See the text of Experimental Example III, part B, 
for the construction of this plasmid and the definitions of all 
symbols (except the SV40 splice and poly A) which are 
defined in the figure legend. 

FIG. 3. The constitutive EcR expression plasmid, pAct- 
EcR. The construction of this plasmid and the definition of 
the symbols are given in Experimental Example III, part B. 


DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENTS 


The present invention provides novel isolated nucleic acid 
sequences encoding polypeptide products exhibiting the 
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structure and/or activities of insect members of the steroid 
receptor superfamily. Having elucidated the structures of 
these insect steroid receptors from their genes, the separate 
ligand-binding domains and DNA-binding domains are used 
individually or in combination to screen for new ligands or 
DNA sequences which bind to these domains. Thus, for 
example, receptors may be used to control expression of 
reporter genes for which sensitive assays exist. Or, the 
hormone-binding domains serve as reagents for screening 
new molecules, useful as either agonists or antagonists of 
steroid receptor superfamily members. Either new classes of 
molecules may be screened, or selected modifications from 
known ligands may be used. These new ligands find use as 
highly specific and highly active, naturally occurring pesti- 
cides. Alternatively, structural information about interac- 
tions between the ligand and binding domains directs meth- 
ods for mutagenizing or substituting particular residues in 
the binding domains, thereby providing for altered binding 
specificity. Thus, inter alia, the present invention provides 
for screening for new ligand molecules, for the design of 
new ligand-binding domain interactions, for producing 
novel chimaeric steroid receptor superfamily members and 
for generating new combinations of ligands and binding 
domains. 

The present invention also provides for the isolation or 
identification of new steroid hormone-responsive elements 
and associated genes. By appropriate operable linkage of 
selected sequences to DNA controlling elements which are 
responsive to binding by the DNA-binding domains of 
steroid receptor superfamily members, new regulatory com- 
binations result. The present invention further provides for 
the design of either a binding domain in a member of the 
insect steroid receptor superfamily that will recognize given 
DNA sequences, or conversely for the modification of DNA 
sequences which will bind to particular DNA-binding 
domains. Both the DNA-binding domain of a superfamily- 
member polypeptide and its DNA recognition sequence can 
be coordinately modified to produce wholly new receptor- 
DNA interactions. 

In an alternative embodiment, a DNA-binding sequence 
recognized by a selected receptor may be operably linked to 
a desired genetic sequence for inducible expression. Thus, 
upon administration of a ligand specific for that selected 
receptor, the desired genetic sequence is appropriately regu- 
lated. Expression systems are constructed that are respon- 
sive to administration of insect steroid receptor superfamily- 
specific ligands. By identifying and isolating new members 
of the insect steroid receptor superfamily, new regulatory 
reagents become available, both with respect to usable 
hormones, and with respect to useable controlling elements. 

In another embodiment, highly regulatable expression of 
a gene may be achieved by use of regulatory elements 
responsive to ligands specific to the superfamily members. 
If transformed cells are grown under conditions where 
expression is repressed or not induced, the cells may grow 
to higher densities and enjoy less stressful conditions. Upon 
reaching high density, the regulatory ligand molecule will 
adjust to cause high expression. If the selected cells are 
otherwise insensitive to the inducing ligand, the cells will 
not be affected by exposure to the ligand used to regulate 
expression. This provides a means both for highly efficient 
regulatable expression of genes, and for introduction of 
these genes into intact organisms. 

In accordance with specific embodiments of the present 
invention, nucleic acid sequences encoding portions of 
insect steroid hormone receptor hormone receptor superfam- 
ily members have been elucidated. For example, certain 


US 6,245,531 B1 


7 


ecdysone receptor polypeptides have been isolated and 
characterized; specifically, DNA's encoding four different 
members of the Drosophila steroid receptor superfamily 
have been characterized. One is the 20-OH ecdysone 
receptor, also called the ecdysone receptor (EcR), for which 
a full-length encoding sequence has been determined. A 
second member is Drosophila hormone receptor 3 (DHR3), 
a protein with sequence homology to various steroid recep- 
tor superfamily members. The third and fourth members of 
the superfamily are E75A and E75B, closely related pro- 
teins. These members are encoded by segments of the same 
gene, and each possesses sequence homology to other ste- 
roid receptor superfamily members. 


The DNA sequences encoding each of these members of 
the insect steroid receptor superfamily provide probes for 
screening for homologous nucleic acid sequences, both in 
Drosophila and other genetic sources. This screening allows 
isolation of homologous genes from both vertebrates and 
invertebrates. Production of large amounts of the encoded 
proteins is effected by inserting those sequences into expres- 
sion systems. 


The EcR, DHR3, E75A and E75B genes are each linked 
to similar DNA sequences which likely function as 
controlling, or regulatory, elements. These controlling ele- 
ments are regulated in a fashion characteristic of response to 
binding by proteins homologous to members of the steroid 
receptor superfamily. The present invention provides for the 
isolation of these hormone-responsive control elements, and 
for their use in regulating gene expression. One embodiment 
of a DNA construct comprises: (1) multiple copies of an 
insect steroid receptor superfamily controlling element 
linked to (2) a minimal gene promoter, preferably not a heat 
shock gene promoter, which provides highly inducible 
expression of (3) an operably linked gene. This construct 
provides a very sensitive assay for the presence of the 
controlling molecule of the receptor. 

Another aspect of the present invention involves cells 
comprising: (1) isolated recombinant gene segments encod- 
ing biologically active fragments of insect steroid receptor 
superfamily proteins; (2) DNA sequences which bind insect 
steroid receptors, e.g., the elements involved in hormone- 
responsive control; or (3) modified receptor proteins. Prog- 
eny of cells which are transformed are included within 
transformed cells generally. In particular, the present inven- 
tion provides for a system whereby expression of polypep- 
tides is responsive to steroid induction. For instance, a 
system which expresses the desired protein in response to 
exposure to ecdysone analogues is constructed by operably 
linking an ecdysone-responsive enhancer to a peptide encod- 
ing segment. 

The present invention also provides insect steroid receptor 
proteins substantially free from naturally-associated insect 
cell components. Such receptors will typically be either 
full-length proteins, functional fragments, or fusion proteins 
comprising segments from an insect steroid receptor protein 
fused to a heterologous, or normally non-contiguous, protein 
domain. 

The present invention further provides a number of meth- 
ods for utilizing the subject receptor proteins. 

One aspect of the present invention is a method for 
selecting new hormone analogues. The isolated hormone- 
binding domains specifically bind hormone ligands, thereby 
providing a means to screen for new molecules possessing 
the property of binding with high affinity to the ligand- 
binding region. Thus, a binding domain of an insect steroid 
receptor superfamily member may be used as a reagent to 
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develop a binding assay. On one level, the binding domains 
can be used as affinity reagents for a batch or in a column 
selective process, to selectively retain ligands which find. 
Alternatively, a functional assay is preferred for its greater 
sensitivity to ligand-binding. By using a reporter molecule 
for binding, either through a direct assay for binding, or 
through an expression or other functional linkage between 
binding and another function, an assay for binding may be 
developed. For example, by operable linkage of an easily 
assayable reporter gene to a controlling element responsive 
to binding by an insect steroid receptor superfamily member, 
and where ligand-binding is functionally linked to protein 
induction, an extremely sensitive assay for the presence of 
a ligand or of a receptor results. Such a construct useful for 
assaying the presence of 20-OH ecdysone is described 
below. This construct is useful for screening for agonists or 
antagonists of the 20-OH ecdysone ligand. 

In particular, this method may be used to detect the ligand 
which bind to a receptor, i.e., an “orphan receptor," whose 
ligand is unknown. Binding domains with “unknown” 
ligands may originate from either newly identified insect 
steroid receptor superfamily members, or from mutagenesis. 
A hybrid receptor may be created with a ligand-binding 
domain and DNA-binding domain from different sources. 
This would allow screening for ligands for *orphan recep- 
tor" binding domains functionally linked to known DNA- 
binding domains which will control known reporter gene 
constructs as described below. This system for ligand- 
receptor binding provides and extremely sensitive assay for 
ligand-receptor interactions. 

Alternatively, the tertiary structure and spatial interactions 
between a ligand-binding domain from an insect steroid 
receptor superfamily member and its ligand will direct 
design for new combinations of ligand-binding domains 
with ligands. Either method provides for selecting highly 
specific and unusual ligands which may be bound only by a 
modification of a natural receptor polypeptide-binding 
domain. Alternatively, novel steroid hormone analogues 
may be selected which exhibit modified specificity for 
binding to a limited group of steroid receptors. 

The present invention also provides for new and useful 
combinations of the various related components. The recom- 
binant nucleic acid sequences encoding the polypeptides, the 
polypeptide sequences, and the DNA sites to which the 
receptors bind (i.e., the regulatory, or control, elements) 
together provide for combining particular components in 
novel fashions. For instance, upon expression, fusing 
nucleic acid sequences encoding peptides from different 
sources will provide polypeptides exhibiting hybrid proper- 
ties. In particular, hybrid receptors comprising segments 
from other members of the superfamily, or from other 
sources, will be made. Hybrid genetic constructs provide for 
genes exhibiting unusual control and expression character- 
istics. Combining an insect steroid receptor-responsive 
enhancer segment with a different polypeptide encoding 
segment will produce a steroid-responsive expression sys- 
tem for that polypeptide. 

The isolation of insect steroid receptors provides for 
isolation or screening of new ligands for receptor binding. 
Some of these will interfere with, or disrupt, normal insect 
development. It may sometimes be important to either 
accelerate or decelerate insect development, for instance, in 
preparing sterile adults for release. Alternatively, in certain 
circumstances, a delay or change in the timing of develop- 
ment may be lethal or may dramatically modify the ability 
of an insect to affect an agricultural crop. Thus, naturally 
occurring, biodegradable and highly active molecules to 
disrupt the timing of insect development will result. 
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Furthermore, these polypeptides provide the means by 
which antibodies have been raised. These antibodies possess 
specificity for binding to particular steroid receptor classes. 
Thus, reagents for determining qualitative or quantitative 
presence of these or homologous polypeptides may be 
produced. Alternatively, these antibodies may be used to 
separate or purify receptor polypeptides. 

Transcription Sequences of Insect Steroid Receptor Super- 
family Members 

The ecdysone receptor gene is a member of the steroid 
and thyroid hormone receptor gene superfamily. The steroid 
receptors and thyroid hormone receptors are components of 
a collective group of ligand-responsive transcription factors. 
See, Evans, Science 240:889-895 (1988), and Segraves, 
Molecular and Genetic Analysis of the E75 Ecdysone- 
Responsive Gene of Drosophila melanogaster (Ph.D. thesis, 
Stanford University 1988), both of which are hereby incor- 
porated herein by reference for all purposes. These receptors 
show extensive sequence similarity, especially in their *zinc 
finger" DNA-binding domains, and also in a ligand, or 
hormone, binding domain. Modulation of gene expression 
occurs apparently in response to receptor binding to specific 
control, or regulatory, elements in the DNA. The cloning of 
receptor cDNAs provides the first opportunity to study the 
molecular bases of steroid action. The steroid receptor 
superfamily is a class of receptors which exhibit similarities 
in structural and functional features. While the term insect is 
used herein, it will be recognized that the same methods and 
molecules may be derived form other species of animals, in 
particular, within the class Insecta, but more broadly should 
be applicable to all members of the phylum Arthropoda, 
which use ecdysteroids as hormones. Thus, although the 
term insect is used herein, it will be recognized that in some 
circumstances the larger group of arthropods may be also 
included. Members of the insect steroid receptor superfam- 
ily (superfamily) are characterized by functional domains 
involved in ligand-binding and DNA binding, both of which 
interact to effect a change in the regulatory state of a gene 
operably linked to the DNA-binding site of the receptor. 
Thus, the receptors of the insect steroid receptor superfamily 
seem to be ligand-responsive transcription factors. The 
receptors of the present invention exhibit at least a hormone- 
binding domain characterized by sequence homology to 
particular regions, labeled E1, E2 and E3. 

The members of the insect steroid receptor superfamily 
are typically characterized by structural homology of par- 
ticular domains, such as defined initially in the estrogen 
receptor. Specifically, a DNA-binding domain, C, and a 
ligand-binding domain, E, are separated and flanked by 
additional domains as identified by Krust et al. (Krust et al. 
(1986), EMBO J. 5:891—897), which is incorporated herein 
by reference. 

The C domain, or zinc-finger DNA-binding domain, is 
usually hydrophilic, having high cysteine, lysine and argi- 
nine content—a sequence suitable for the required tight 
binding. The E domain is usually hydrophobic and charac- 
terized as regions E1, E2 and E3. The ligand-binding 
domains of the present invention are typically characterized 
by having significant homology in sequence and structure to 
these three regions. Amino proximal to the C domain is a 
region initially defined as separate A and B domains. Region 
D separates the more conserved domains C and E. Region D 
typically has a hydrophilic region whose predicted second- 
ary structure is rich in turns and coils. The F region is 
carboxy proximal to the E region (see, Krust et al., supra). 

The ligand-binding domain of the members of the insect 
steroid receptor superfamily is typically carboxyl-proximal, 
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relative to a DNA-binding domain described below. See, 
Evans, Science 240:889—895. The entire hormone-binding 
domain is typically between about 200 and 250 amino acids 
but may be less. This domain has the subregions of high 
homology, termed the E1, E2 and E3 regions. See Table 4. 

The El region is 19 amino acids long with a consensus 
sequence AKX(L/DPGEXXLI(L/D(D/E)DOITLL, where X 
represents any amino acid and the other letters are the 
standard single-letter code. Positions in parentheses are 
alternatives. Typically, members of the insect steroid recep- 
tor superfamily will have at least about five matches out of 
the sixteen assigned positions, preferably at least about nine 
matches, and in preferred embodiments, at least about ten 
matches. Alternatively, these insect steroid receptor super- 
family members will have homologous sequences exhibiting 
at least about 3596 homology, preferably at least about 5596 
homology and more preferably at least about 60% to 70% 
homology at positions assigned preferred amino acids. 

The E2 region is a 19 amino-acid segment with a con- 
sensus sequence: 

E(F/YY( A/V)L/CO(L/M)KA(/L)( VIL)L(L/D(N/S(S/P)D 

(P/-)(R/K)(P/D)GL, 

where—represents an optional absence of an amino acid. 
Typically, an insect steroid receptor superfamily member 
will exhibit at least about six matches, preferably at least 
about eight matches and more preferably at least about nine 
matches. Alternatively, E2 sequences of insect steroid recep- 
tor superfamily members exhibit at least about 30% 
homology, preferably at least about 40% homology, and 
more preferably at least about 45% homology. 

The E3 region is a 12 amino-acid segment with a con- 
sensus sequence 

LXKLLXXLPDLR. 

The insect steroid receptor superfamily members will typi- 
cally show at least about four matches out of the nine 
assigned preferences in the E3 region, preferably at least 
about five matches and more preferably at least about six 
matches. Alternatively, over the assigned positions, mem- 
bers of the insect steroid receptor superfamily will typically 
exhibit at least about 45% homology, usually at least about 
55% homology and preferably at least about 65% homology. 

In preferred embodiments, the insect steroid receptor 
superfamily members will exhibit matching of at least about 
five positions in an E1 region, at least about six positions in 
an E3 region and at least about four positions in an E3 
region. Thus, a combination of all three regional sequence 
constraints is especially preferred. 

The DNA-binding domain of these insect steroid receptor 
superfamily members is characterized by a “zinc fingers” 
motif. See, Evans, Science 240:889-895. The domain is 
typically amino proximal to the ligand, or hormone, binding 
site. Typically, the DNA-binding domain of the insect ste- 
roid receptor superfamily members is characterized by clus- 
tering of basic residues, a cysrich composition and homol- 
ogy in sequence. See, Evans, R. M. (1988), Science 
240:889-89; and Experimental section below. Significant 
sequence homology among superfamily members exists. 
Typically, the insect steroid receptor superfamily members 
will exhibit at least about [30]% homology in the 67+1 
amino acid region of this domain, usually at least about 40% 
homology, and preferably at least about 45% homology. 

Steroids are derivatives of the saturated tetracyclic hydro- 
carbon perhydrocyclopentanophenanthrene. Among the 
molecules in the group “steroids” are the bile acids, cholic 
acid and deoxycholic acid, the adrenocortical steroids, such 
as corticosterone and aldosterone, the estrogens such as 
estrone and f-estradiol, the androgens, such as testosterone 
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and progesterone, and the ecdysteroids. The terms steroid or 
steroid hormones are used interchangeably herein and are 
intended to include all steroid analogues. Typically, steroid 
analogues are molecules which have minor modifications of 
various peripheral chemical groups. See, Koolman (ed.) 
(1989), cited above, for details on ecdysteroids. 

Although ligands for the insect steroid receptor superfam- 
ily members have historically been characterized as steroids, 
the term “steroid” in the label “Insect steroid receptor 
superfamily" is not meant literally. The use of *steroid" has 
resulted from a historical label of members of a group 
recognized initially to include only steroids. However, the 
limitation no longer is applicable. Thus, there may be 
members of the insect steroid receptor superfamily, as 
defined herein, whose ligand-binding specificity is not 
directed to *steroids." Typically, the ligands for members of 
the insect steroid receptor superfamily are lipophilic mol- 
ecules. 

The term *ligand" is meant herein to exclude the DNA 
sequence to which the DNA-binding domain binds. Thus, 
the term ligand is meant to refer to the molecules that bind 
the domain described here as the “hormone-binding 
domain." Also, a ligand for an insect steroid receptor super- 
family member is a ligand which serves either as the natural 
ligand to which the member binds, or a functional analogue 
which may serve as an agonist or antagonist. However, the 
functional term *hormone" is used, again, because of the 
historic usage to describe the receptors, but is meant to apply 
to virtually any chemical messenger used to communicate 
between cell types. These molecules are typically used in 
intercellular signal transduction, but are not limited to those 
molecules having slow or systemic effects. 

Substantial homology in the nucleic acid context means 
either that the segments, or their complementary strands, 
when compared, are identical when optimally aligned, with 
appropriate nucleotide insertions or deletions, in at least 
about 60% of the residues, usually at least about 80% and 
preferably at least 90% of the nucleotides. Alternatively, 
substantial homology exists when the segments will hybrid- 
ize under selective hybridization conditions, to a strand, or 
its complement, typically using a sequence derived from 
Table 1, 2 or 3. Selectivity of hybridization exists when 
hybridization occurs which is more selective than total lack 
of specificity. Typically, selective hybridization will occur 
when there is at least about 55% homology over a stretch of 
at least about 14/25 nucleotides, preferably at least about 
65%, more preferably at least about 75%, and most prefer- 
ably at least about 9096. See, Kanehisa, M. (1984), Nucleic 
Acids Res. 12:203-213, which is incorporated herein by 
reference. Stringent hybridization conditions will typically 
include salt concentrations of less than about 1 M, more 
usually less than about 500 mM and preferably less than 
about 200 mM. Temperature conditions will typically be 
greater than 20? C., more usually greater than about 30? C. 
and preferably in excess of about 37? C. As other factors 
may significantly affect the stringency of hybridization, 
including, among others, base composition and size of the 
complementary strands, presence of organic solvents and 
extent of base mismatching, the combination of parameters 
is more important than the absolute measure of any one. 

A gene for an insect steroid receptor superfamily member 
gene includes its upstream (e.g., promoter) and downstream 
operably linked controlling elements, as well as the comple- 
mentary strands. It also comprises the segment encoding the 
transcription unit, including both introns and exons. Thus, an 
isolated gene allows for screening for new steroid receptor 
genes by probing for genetic sequences which hybridize to 
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either controlling or transcribed segments of a receptor gene 
of the present invention. Three segments of particular inter- 
est are the controlling elements, both upstream and 
downstream, and segments encoding the DNA-binding seg- 
ments and the hormone-binding segments. 

Insect Steroid Receptor Superfamily Member Polypeptides 

A polypeptide sequence of the ecdysone receptor is rep- 
resented in Table 2. Other insect steroid receptor superfam- 
ily member polypeptide sequences are set forth in Tables 1 
and 3. Preferred nucleic acid sequences of the cDNAs 
encoding these insect steroid receptor superfamily member 
polypeptides are also provided in the corresponding tables. 
Other nucleic acids may be used to encode the proteins, 
making use of the degeneracy or non-universality of the 
genetic code. 

As used herein, the term “substantially pure” describes a 
protein or other material which has been separated from its 
native contaminants. Typically, a monomeric protein is 
substantially pure when at least about 60 to 7596 of a sample 
exhibits a single polypeptide backbone. Minor variants or 
chemical modifications typically share the same polypeptide 
sequence. Usually a substantially pure protein will comprise 
over about 85 to 90% of a protein sample, and preferably 
will be over about 99% pure. Normally, purity is measured 
on a polyacrylamide gel, with homogeneity determined by 
staining. Alternatively, for certain purposes high resolution 
will be necessary and HPLC or a similar means for purifi- 
cation will be used. For most purposes, a simple chroma- 
tography column or polyacrylamide gel will be used to 
determine purity. 

The term "substantially free of naturally-associated insect 
cell components" describes a protein or other material which 
is separated from the native contaminants which accompany 
it in its natural insect cell state. Thus, a protein which is 
chemically synthesized or synthesized in a cellular system 
different from the insect cell from which it naturally origi- 
nates will be free from its naturally-associated insect cell 
components. The term is used to describe insect steroid 
receptor superfamily members and nucleic acids which have 
been synthesized in mammalian cells or plant cells, E. coli 
and other procaryotes. 

The present invention also provides for analogues of the 
insect steroid receptor superfamily members. Such ana- 
logues include both modifications to a polypeptide backbone 
and variants and mutants of the polypeptides. Modifications 
include chemical derivatizations of polypeptides, such as 
acetylations, carboxylations and the like. They also include 
glycosylation modifications and processing variants of a 
typical polypeptide. These processing steps specifically 
include enzymatic modifications, such as ubiquinization. 
See, e.g., Hershko and Ciechanover (1982), *Mechanisms of 
Intracellular Protein Breakdown," Ann. Rev. Bioch., 
51:335-364. 

Other analogues include genetic variants, both natural and 
induced. Induced mutants may be derived from various 
techniques including both random mutagenesis using 
reagents such as irradiation or exposure to EMS, or may take 
the form of engineered changes by site-specific mutagenesis 
or other techniques of modern molecular biology. See, 
Sambrook, Fritsch and Maniatis (1989), Molecular Cloning: 
À Laboratory Manual (2d ed.), CSH Press. 

As described above, the DNA-binding zinc fingers seg- 
ment of a receptor shows high specificity of recognition of 
specific target DNA sequences. An understanding of the 
DNA protein-binding interactions provides for the modifi- 
cation in a rational manner either DNA or protein 
characteristics, or both, to effect specificity of binding for 
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modulation of enhancer activity. More importantly, isolation 
of genes for new members of the insect steroid receptor 
superfamily allows their use to produce the receptor 
polypeptides and to isolate and isolate new controlling 
elements. By using the DNA-binding domains, as described 
above, controlling elements which are responsive to the 
ligands bound by the corresponding superfamily members 
may be identified and isolated. This shall yield a variety of 
controlling elements responsive to ligands. By the methods 
described above, the ligands for any particular member of 
the insect steroid receptor superfamily may be identified. 

The controlling elements typically are enhancers, but may 
also include silencers or various other types of ligand- 
responsive elements. They may operate at large distances, 
but will typically be within about 50 kb, usually within about 
35 kb, more usually within about 20 kb and preferably 
within about 7 kb of the genes that these elements regulate. 
Polypeptide Fragments and Fusions 

Besides substantially full-length polypeptides, the present 
invention provides for biologically active fragments of the 
polypeptides. Significant biological activities include 
ligand-binding, DNA binding, immunological activity and 
other biological activities characteristic of steroid receptor 
superfamily members. Immunological activities include 
both immunogenic function in a target immune system, as 
well as sharing of immunological epitopes for binding, 
serving as either a competitor or substitute antigen for a 
steroid receptor epitope. 

For example, ligand-binding or DNA-binding domains 
may be “swapped” between different new fusion polypep- 
tides or fragments. Thus, new chimaeric polypeptides exhib- 
iting new combinations of specificities result from the 
functional linkage of ligand-binding specificities are DNA- 
binding domains. This is extremely useful in the design of 
inducible expression systems. 

For immunological purposes, immunogens may be pro- 
duced which tandemly repeat polypeptide segments, thereby 
producing highly antigenic proteins. Alternatively, such 
polypeptides will serve as highly efficient competitors for 
specific binding. Production of antibodies to insect steroid 
receptor superfamily members is described below. 

The present invention also provides for other polypeptides 
comprising fragments of steroid receptor superfamily mem- 
bers. Thus, fusion polypeptides between the steroid receptor 
segments and other homologous or heterologous proteins are 
provided. Homologous polypeptides may be fusions 
between different steroid receptor superfamily members, 
resulting in, for instance, a hybrid protein exhibiting ligand 
specificity of one member and DNA-binding specificity of 
another. Likewise, heterologous fusions may be constructed 
which would exhibit a combination of properties or activi- 
ties of the derivative proteins. Typical examples are fusions 
of a reporter polypeptide, e.g., luciferase, with another 
domain of a receptor, e.g., a DNA-binding domain, so that 
the presence or location of a desired ligand may be easily 
determined. See, e.g., Dull et al., U.S. Pat. No. 4,859,609, 
which is hereby incorporated herein by reference. Other 
typical gene fusion partners include “zinc finger” segment 
swapping between DNA-binding proteins, bacterial 
p-galactosidase, trpE Protein A, f-lactamase, alpha anylase, 
alcohol dehydrogenase and yeast alpha mating factor. See, 
e.g., Godowski et al. (1988), Science 241:812-816; and 
Experimental section below. 

Insect Steroid Receptor Superfamily Member Expression 

With the sequence of the receptor polypeptides and the 
recombinant DNA sequences encoding them, large quanti- 
ties of members of the insect steroid receptor superfamily 
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will be prepared. By the appropriate expression of vectors in 
cells, high efficiency production may be achieved. 
"Thereafter, standard purification methods may be used, such 
as ammonium sulfate precipitations, column 
chromatography, electrophoresis, centrifugation, crystalliza- 
tion and others. See various volumes of Methods in Enzy- 
mology for techniques typically used for protein purification. 
Alternatively, in some embodiments high efficiency of pro- 
duction is unnecessary, but the presence of a known induc- 
ing protein within a carefully engineered expression system 
is quite valuable. For instance, a combination of: (1) a 
ligand-responsive enhancer of this type operably linked to 
(2) a desired gene sequence with (3) the corresponding 
insect steroid receptor superfamily member together in an 
expression system provides a specifically inducible expres- 
sion system. Typically, the expression system will be a cell, 
but an in vitro expression system may also be constructed. 

The desired genes will be inserted into any of a wide 
selection of expression vectors. The selection of an appro- 
priate vector and cell line depends upon the constraints of 
the desired product. Typical expression vectors are described 
in Sambrook et al. (1989). Suitable cell lines may be selected 
from a depository, such as the ATCC. See, ATCC Catalogue 
of Cell Lines and Hybridomas (6th ed.) (1988); ATCC Cell 
Lines, Viruses, and Antisera, each of which is hereby 
incorporated herein by reference. The vectors are introduced 
to the desired cells by standard transformation or transfec- 
tion procedures as described, for instance, in Sambrook et al. 
(1989). 

Fusion proteins will typically be made by either recom- 
binant nucleic acid methods or by synthetic polypeptide 
methods. Techniques for nucleic acid manipulation are 
described generally, for example, in Sambrook et al. (1989), 
Molecular Cloning: A Laboratory Manual (2d ed.), Vols. 
1-3, Cold Spring Harbor Laboratory, which are incorporated 
herein by reference. Techniques for synthesis of polypep- 
tides are described, for example, in Merrifield, J. Amer. 
Chem. Soc. 85:2149-2156 (1963). 

The recombinant nucleic acid sequences used to produce 
fusion proteins of the present invention may be derived from 
natural or synthetic sequences. Many natural gene sequences 
are obtainable from various cDNA or from genomic libraries 
using appropriate probes. See, GenBank™, National Insti- 
tutes of Health. Typical probes for steroid receptors may be 
selected from the sequences of Tables 1, 2 or3 in accordance 
with standard procedures. Suitable synthetic DNA fragments 
may be prepared by the phosphoramidite method described 
by Beaucage and Carruthers, Tetra. Letts. 22:1859-1862 
(1981). A double stranded fragment may then be obtained 
either by synthesizing the complementary strand and anneal- 
ing the strand together under appropriate conditions or by 
adding the complementary strand using DNA polymerase 
with an appropriate primer sequence. 

With the isolated steroid receptor genes, segments of the 
transcribed segments may be used as probes for isolating 
homologous sequences from different sources, either differ- 
ent animals, or different but homologous genes exhibiting 
sequence homology. By selection of the segment used as a 
probe, particular functionally associated segments will be 
isolated. Thus, for example, other nucleic acid segments 
encoding either ligand-binding or DNA-binding domains of 
new receptors will be isolated. Alternatively, by using 
steroid-responsive controlling elements as a probe, new 
steroid-responsive elements will be isolated, along with the 
associated segment of DNA whose expression is regulated. 
This method allows for the isolation of ligand-responsive 
genes, many of which are, themselves, also members of the 
insect steroid receptor superfamily. 
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The natural or synthetic DNA fragments coding for a 
desired steroid receptor fragment will be incorporated into 
DNA constructs capable of introduction to and expression in 
an in vitro cell culture. Usually the DNA constructs will be 
suitable for replication in a unicellular host, such as yeast or 
bacteria, but may also be intended for introduction to, with 
and without and integration within the genome, cultured 
mammalian or plant or other eucaryotic cell lines. DNA 
constructs prepared for introduction into bacteria or yeast 
will typically include a replication system recognized by the 
host, the intended DNA fragment encoding the desired 
receptor polypeptide, transcription and translational initia- 
tion regulatory sequences operably linked to the polypeptide 
encoding segment and transcriptional and translational ter- 
mination regulatory sequences operably linked to the 
polypeptide encoding segment. The transcriptional regula- 
tory sequences will typically include a heterologous 
enhancer or promoter which is recognized by the host. The 
selection of an appropriate promoter will depend upon the 
host, but promoters such as the trp, lac and phage promoters, 
tRNA promoters and glycolytic enzyme promoters are 
known. See, Sambrook et al. (1989). Conveniently available 
expression vectors which include the replication system and 
transcriptional and translational regulatory sequences 
together with the insertion site for the steroid receptor DNA 
sequence may be employed. Examples of workable combi- 
nations of cell lines and expression vectors are described in 
Sambrook et al. (1989); see also, Metzger et al. (1988), 
Nature 334:31—36. 

Genetic Constructs 

The DNA segments encoding the members of the insect 
steroid receptor superfamily will typically be utilized in a 
plasmid vector. Two separate embodiments exist, the first 
having an expression control DNA sequence operably linked 
to the insect steroid receptor superfamily member coding 
sequences for expression of the insect steroid receptor 
superfamily member alone. A second includes an insect 
steroid receptor superfamily member as a component of an 
expression system for another gene to make expression of 
that other gene ligand responsive. This latter embodiment is 
separately described just below. The expression control 
sequences will be commonly eucaryotic enhancer or pro- 
moter systems in vectors capable of transforming or trans- 
fecting eucaryotic host cells. Once the vector has been 
incorporated into the appropriate host, the host, depending 
on the use, will be maintained under conditions suitable for 
high level expression of the nucleotide sequences. 
Steroid-responsive Expression of Selected Genes 

For steroid-responsive expression of other genes, the 
steroid receptor gene will typically be cotransformed with a 
recombinant construct comprising a desired gene for expres- 
sion operably linked to the steroid-responsive enhancer or 
promoter element. In this use, a single expression system 
will typically comprise a combination of (1) a controlling 
element responsive to a ligand of an insect steroid receptor 
superfamily member, (2) a desired gene for expression, 
operably linked to the controlling element, and (3) an insect 
steroid receptor superfamily member which can bind to the 
controlling element. Usually, this system will be within a 
cell, but an in vitro system is also possible. The insect steroid 
receptor superfamily member will typically be provided by 
expression of a nucleic acid encoding it, though it need not 
be expressed at particularly high levels. Thus, in one pre- 
ferred embodiment, the system will be achieved through 
cotransformation of a cell with both the regulatable con- 
struct and another segment encoding the insect steroid 
receptor superfamily member. Usually, the controlling ele- 
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ment will be an enhancer element, but it may work in reverse 
and be used to repress expression. In this embodiment, the 
ligand for the insect steroid receptor superfamily member 
will be provided or withheld as appropriate for the desired 
expression properties. 

A particularly useful genetic construct comprises an alco- 
hol dehydrogenase promoter operably linked to an easily 
assayable reporter gene, e.g., p-galactosidase. In a preferred 
embodiment of this construct, a multiplicity of copies of the 
insect steroid receptor superfamily member is used. For 
example, operable linkage of controlling elements respon- 
sive to insect steroid receptor superfamily members, e,g., 
EcR, DHR3, E75A and E75B, to the alcohol dehydrogenase 
(ADH) promoter, or others as described above, and protein 
coding sequences for a particular reporter protein, as 
described above leads to steroid-responsive expression of 
that protein. This controlling element responsive to the 
construct provides a very sensitive system for the detection 
of responsive expression. This will be used in sensitive 
assays for the presence of a receptor-ligand interaction, 
allowing for detection of either ligand or receptor or both. 

DNA sequences will normally be expressed in hosts after 
the sequences have been operably linked to (i.e., positioned 
to ensure the functioning of) an expression control sequence. 
These expression vectors are typically replicable in the host 
organisms either as episomes or as an integral part of the 
host chromosomal DNA. Commonly, expression vectors 
will contain selection markers, e.g., tetracycline or 
neomycin, to permit detection of those cells transformed 
with the desired DNA sequences (see, e.g., U.S. Pat. No. 
4,704,362, which is incorporated herein by reference). 

E. coli is one procaryotic host useful for cloning the DNA 
sequences of the present invention. Other microbial hosts 
suitable for use include bacilli, such as Bacillus subtilis, and 
other enterobacteriaceae, such as Salmonella, Serratia, and 
various Pseudomonas species. 

Other eucaryotic cells may be used, including yeast cells, 
insect tissue culture cells, avian cells or the like. Preferably, 
mammalian tissue cell culture will be used to produce the 
inducible polypeptides of the present invention (see, 
Winnacker, From Genes to Clones, VCH Publishers, N.Y. 
(1987), which is incorporated herein by reference). Mam- 
malian cells are preferred cells in which to use the insect 
steroid receptor superfamily member ligand-responsive 
gene constructs because they naturally lack the molecules 
which confer responses to the ligands for insect steroid 
receptor superfamily members. 

Since mammalian cells are insensitive to many ligands for 
insect steroid receptor superfamily members, exposure of 
these cells to the ligands of the insect steroid receptor 
superfamily members typically will have negligible physi- 
ological or other effects on the cells, or on a whole organism. 
This insensitivity of the cells to the ligands provides pre- 
ferred combination of ligand induction with an otherwise 
insensitive cell. This provides for transformation of insen- 
sitive cells with the controlling element operably linked to a 
derived gene, resulting in an expression system whose 
ligand for eliciting response causes minimal physiological 
effects. Therefore, cells can grow and express substantially 
unaffected by the presence of the ligand. The ligand may 
cause response either in the positive or negative direction. 
For example, cells might be desired to be grown to high 
density before expression. In a positive induction system, 
the inducing ligand would be added upon reaching high 
density, but since the ligand itself is innocuous to the cells, 
the only physiological imbalances result from the expression 
itself. Alternatively, in a negative repression system, the 
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ligand is supplied until the cells reach a high density, but 
again, the presence of the ligand is innocuous. Upon reach- 
ing a high density, the ligand would be removed. Introduc- 
tion of these cells into whole organisms may be performed 
so that the products of expression may be provided to the 
whole organism. In this circumstance, the natural insensi- 
tivity of cells to the ligands will also be advantageous. 

Expression vectors for these cells can include expression 
control sequences, such as an origin of replication, a 
promoter, an enhancer and necessary processing information 
sites, such as ribosome-binding sites, RNA splice sites, 
polyadenylation sites, and transcriptional terminator 
sequences. Preferably, the enhancers or promoters will be 
those naturally associated with genes encoding the steroid 
receptors, although it will be understood that in many cases 
others will be equally or more appropriate. Other preferred 
expression control sequences are enhancers or promoters 
derived from viruses, such as SV40, Adenovirus, Bovine 
Papilloma Virus, and the like. 

Similarly, preferred promoters are those found naturally 
in immunoglobulin-producing cells (see, U.S. Pat. No. 
4,663,281, which is incorporated herein by reference), but 
SV40, polyoma virus, cytomegalovirus (human or murine) 
and the LTR from various retroviruses (such as murine 
leukemia virus, murine or Rous sarcoma virus and HIV) 
may be utilized. See, Enhancers and Eukaryotic Gene 
Expression, Cold Spring Harbor Press, N.Y., 1983, which is 
incorporated herein by reference. 

The vectors containing the DNA segments of interest 
(e.g., the steroid receptor gene, the recombinant steroid- 
responsive gene, or both) can be transferred into the host cell 
by well-known methods, which vary depending on the type 
of cellular host. For example, calcium chloride transfection 
is commonly utilized for procaryotic cells, whereas calcium 
phosphate treatment may be used for other cellular hosts. 
(See, generally, Sambrook et al. (1989), Molecular Cloning: 
A Laboratory Manual (2d ed.), Cold Spring Harbor Press, 
which is incorporated herein by reference.) The term “trans- 
formed cell" is meant to also include the progeny of a 
transformed cell. 

As with the purified polypeptides, the nucleic acid seg- 
ments associated with the ligand-binding segment and the 
DNA-binding segment are particularly useful. These gene 
segments will be used as probes for screening for new genes 
exhibiting similar biological activities, though the control- 
ling elements of these genes are of equal importance, as 
described below. 

Many types of proteins are preferentially produced in 
eucaryotic cell types because of abnormal processing or 
modification in other cell types. Thus, mammalian proteins 
are preferably expressed in mammalian cell cultures. Effi- 
cient expression of a desired protein may be achieved, as 
described above, by placing: (1) a desired protein encoding 
DNA sequence adjacent to controlling elements responsive 
to ligands for insect steroid receptor superfamily members 
and an appropriate promoter. Furthermore, unhealthy cells 
are particularly difficult to maintain alive and efficiency of 
expression of exogenous proteins falls. Inducible expression 
systems partly solve this problem, but the presently avail- 
able inducing molecules have direct side effects on the cells. 
By selecting an inducing molecule which otherwise has no 
effects on the cell, a more natural physiological state of the 
cells may be achieved in growing the cells to high density. 
Upon exposure to such an inducing molecule, the cells 
initially in a healthy state will produce the desired protein at 
high levels without the harmful effects resulting from the 
action of the inducing molecule itself. Ecdysteroids and 
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other ligands for insect steroid receptor superfamily mem- 
bers are not normally found in mammalian cells, and thus 
serve as favorable candidates for a role as innocuous induc- 
ing molecules. Cyclic pulses of ligands in a cell culture may 
provide periods for cells to recover from effects of produc- 
tion of large amounts of exogenous protein. 

Additional steroid responsive gene elements have also 
been isolated using the techniques of the present invention. 
Other genes adjacent to, and operably linked to, steroid 
responsive gene controlling elements are selectable by locat- 
ing DNA segments to which steroid receptors specifically 
bind or by hybridization to homologous controlling ele- 
ments. For example, other steroid responsive genes have 
been isolated. Many of the genes which are ligand- 
responsive may also be new members of the insect steroid 
receptor superfamily. 

Having provided for the substantially pure polypeptides, 
biologically active fragments thereof and recombinant 
nucleic acids comprising genes for them, the present inven- 
tion also provides cells comprising each of them. By appro- 
priate introduction techniques well known in the field, cells 
comprising them may be produced. See, e.g., Sambrook et 
al. (1989). 

In particular, cells comprising the steroid responsive 
controlling elements are provided, and operable linkage of 
standard protein encoding segments to said controlling ele- 
ments produce steroid responsive systems for gene expres- 
sion. Cells so produced may be introduced into intact 
organisms, for example, plants, insects (including caterpil- 
lars and larvae) and animals. This may provide for a form of 
regulable expression of desired genes but where the regu- 
lating ligand has no other effects on the cells because they 
otherwise lack the receptors and responsive genes. For 
example, plants the receptors and responsive genes. For 
example, plants may be induced to fruit at desired times by 
administration of the appropriate ligand, or animals may be 
ligand-responsive in production of particular products. And, 
in fact, biochemical deficiencies may be overcome by 
ligand-responsive expression of cells introduced into an 
intact organism which, itself, also otherwise lacks genes 
responsive to the presence of such a ligand. Cells containing 
these expression systems may be used in gene therapy 
procedures, including in humans. 

Once a sufficient quantity of the desired steroid receptor 
polypeptide has been obtained, the protein may be used for 
various purposes. À typical use is the production of anti- 
bodies specific for binding to steroid receptors. These anti- 
bodies may be either polyclonal or monoclonal and may be 
produced by in vitro or in vivo techniques. 

For production of polyclonal antibodies, an appropriate 
target immune system is selected, typically a mouse or 
rabbit. The substantially purified antigen is presented to the 
immune system in a fashion determined by methods appro- 
priate for the animal and other parameters well known to 
immunologists. Typical sites for injection are in the 
footpads, intramuscularly, intraperitoneally, or intrader- 
mally. Of course, another species may be substituted for a 
mouse or rabbit. 

An immunological response is usually assayed with an 
immunoassay. Normally such immunoassays involve some 
purification of a source of antigen, for example, produced by 
the same cells and in the same fashion as the antigen was 
produced. The immunoassay may be a radioimmunoassay, 
an enzyme-linked assay (ELISA), a fluorescent assay, or any 
of many other choices, most of which are functionally 
equivalent but may exhibit advantages under specific con- 
ditions. 


US 6,245,531 B1 


19 


Monoclonal antibodies with affinities of 10° M^! prefer- 
ably 10? to 1019, or stronger will typically be made by 
standard procedures as described, e.g., in Harlow and Lane 
(1988), Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory; or Goding (1986), Monoclonal Antibod- 
ies: Principles and Practice (2d ed) Academic Press, New 
York, which are hereby incorporated herein by reference. 
Briefly, appropriate animals will be selected and the desired 
immunization protocol followed. After the appropriate 
period of time, the spleens of such animals are excised and 
individual spleen cells fused, typically, to immortalized 
myeloma cells under appropriate selection conditions. 
Thereafter the cells are clonally separated and the superna- 
tants of each clone are tested for their production of an 
appropriate antibody specific for the desired region of the 
antigen. 

Other suitable techniques involve in vitro exposure of 
lymphocytes to the antigenic polypeptides or alternatively to 
selection of libraries of antibodies in phage or similar 
vectors. See, Huse et al., (1989) “Generation of a Large 
Combinatorial Library of the Immunoglobulin Repertoire in 
Phage Lambda," Science 246:1275—1281, hereby incorpo- 
rated herein by reference. 

The polypeptides and antibodies of the present invention 
may be used with or without modification. Frequently, the 
polypeptides and antibodies will be labeled by joining, 
either covalently or non-covalently, a substance which pro- 
vides for a detectable signal. A wide variety of labels and 
conjugation techniques are known and are reported exten- 
sively in both the scientific and patent literature. Suitable 
labels include radionuclides, enzymes, substrates, cofactors, 
inhibitors, fluorescens, chemiluminescers, magnetic par- 
ticles and the like. Patents, teaching the use of such labels 
include U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149; and 4,366,241. Also, 
recombinant immunoglobulins may be produced, see 
Cabilly, U.S. Pat. No. 4,816,567. 

Another use of purified receptor polypeptides is for deter- 
mination of the structural and biosynthetic aspects of the 
polypeptides. Structural studies of interactions of the ligand- 
binding domains with selected ligands may be performed by 
various methods. The preferred method for structural deter- 
mination is X-ray crystallography but may include various 
other forms of spectroscopy or chromatography. See, e.g., 
Connolly, M. L., J. Appl. Crystall., 16:548 (1983); and 
Connolly, M. L., Science 221:709 (1983), which are hereby 
incorporated herein by reference. For example, the structure 
of the interaction between hormone ligand and hormone- 
binding segments may be determined to high resolution. 
From this information, minor substitutions or modifications 
to either or both of the ligand and ligand-binding segment 
may be made. This information enables the generation of 
modified interactions between a ligand and its binding 
segment to either increase or decrease affinity of binding and 
perhaps increase or decrease response to binding. Likewise, 
the interaction between the zinc fingers DNA-binding seg- 
ments with the specific nucleic acid-binding sequence may 
be similarly modified. 

As a separate and additional approach, isolated ligand- 
binding polypeptide domains may be utilized to screen for 
new ligands. This permits screening for new agonists or 
antagonists of a particular steroid receptor. Isolated DNA- 
binding segments may be used to screen for new DNA 
sequences which will specifically bind to a particular 
receptor-binding segment. Typically, these receptor-specific 
binding sites will be controlling elements for steroid respon- 
sive genes. Thus, having isolated these DNA-binding 
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sequences, genes which are responsive to the binding of a 
given receptor can be isolated. This provides a method for 
isolating genes which are responsive to induction or inhi- 
bition by a given hormone receptor. 

In another aspect of the present invention, means for 
disrupting insect development are provided where new 
ligand agonists or antagonists are discovered. These com- 
pounds are prime candidate as agonists or antagonists to 
interfere with the normal insect development. By application 
of new steroid analogues of ligands for insejct steroid 
receptor superfamily members, it is possible to modify the 
normal temporal sequence of developmental events. For 
example, accelerating insect development will minimize 
generation time. This may be very important in circum- 
stances where large numbers of insects are desired finally, 
for instance, in producing sterile males in Mediterranean fly 
infestations. Alternatively, it may be useful to slow devel- 
opment in a pest infestation, such that the insects reach 
destructive stages of development only after commercial 
crops may have passed sensitive stages. 

In another commercial application, ligands discovered by 
methods provided by the present invention may be used in 
the silk-production industry. Here, the silkworms are artifi- 
cially maintained in a silk-producing larvae stage, thereby 
being silk productive for extended time periods. The devel- 
opment of larvae may also be accelerated to reach the 
silk-producing stage in their life cycle earlier than naturally. 

Other analogues of ligands for insect steroid receptor 
superfamily members may be selected which, upon 
application, may be completely disruptive of normal 
development, leading to a lethal result. However, the use of 
slightly modified natural substances will often have greater 
specificity of action and much higher activities, thus allow- 
ing for lower levels of application. Also, because the ligands 
may be more lipophilic, they may be more readily absorbed 
directly into the insect surface or article. Extremely low 
amounts of natural ligands may be effective in controlling 
pests. Furthermore, many of these ligands are likely top be 
relatively easily manufacture, perhaps by biological meth- 
ods using enzymatic production methods. There may be new 
ligands for insect steroid receptor superfamily members 
which may be more species specific or may exhibit a 
particularly useful spectrum of effectiveness, for example, 
being lethal to harmful insects. The greater specificity of the 
hormones will allow avoidance of use of non-specific pes- 
ticides possessing undesired deleterious ecological side 
effects. For instance residue of pesticides accumulate in 
food, often having deleterious effects on humans. 
Furthermore, compounds having structures closely analo- 
gous to natural compounds may be susceptible to natural 
mechanisms of biological degradation. 

Another aspect of the present invention provides for the 
isolation or design of new gene segments which are respon- 
sive to ligands for insect steroid receptor superfamily mem- 
bers. For example, use of the nucleic acids to screen for 
homologous sequences by standard techniques will provide 
genes having similar structural features. Similarly arranged 
intron structures will typically be characteristic of larger 
superfamily categories. The preferred domains for screening 
will be the ligand-binding or DNA-binding segments, 
however, the DNA segments which are recognized by the 
DNA-binding domains, i.e., the controlling elements, will 
also be of particular interest. By screening for new control- 
ling elements, by either sequence homology to other known 
ones, or by screening with the DNA zinc finger-binding 
domains of other receptors, additional receptors can be 
isolated. Receptors and genes important in the general 
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developmental sequence of expression will be discovered. B. Genetic Regulatory Activity of the EcR Protein in vivo 
Using this set of developmentally regulated genes will allow Methods 
selection of particular molecules which are responsible for Construction of the pAdh/Bgal, pEcRE/Adh/fgal and 
controlling expression of developmentally regulated genes. pActEcR plasmids 


The following experimental section is offered by way of 5 


he Transfection and generation of the cell line SRS 1.5 
example and not by limitation. 


C. Specific Binding of the EcR Protein to Ecdysone 


Response Elements 
EXPERIMENTAL Methods 
EXAMPLE I jọ Conditions for the DNA binding assay 
CLONING STRUCTURE AND EXPRESSION OF EXAMPLE IV 
THE DROSOPHILA E75 GENE THAT ENCODES 
TWO MEMBERS OF THE STEROID RECEPTOR GENE MUTAGENESIS 
RECEPTOR SUPERFAMILY P. Deletion Mutations 
A. Cloning of Genomic DNA Encompassing the Ecdysone- B. E75 Mutations Generated by Ethyl Methane Sulfonate 
Inducible 75B Puff Locus Methods 
Methods Strains, markers and chromosomes 
Genomic DNA libraries Quantitative Southern blot mapping for detection of 
In situ hybridization 20 mutant lesions 
B. Identification of a 50-kb Region of Cloned Genomic Molecular cloning of mutant lesions 
DNA that Contains Sequences Homologous to Ecdysone- Gamma ray mutagenesis 
induced Transcripts EMS mutagenesis 
Methods | res . : 
. k In situ hybridization and cytological analysis 
Organ culture and RNA isolation 25 
Southern blot analysis EXPERIMENTAL 


C. The E75 Gene Contains Two Overlapping Transcription 


Units: E75A and E75B EXAMPEET 
Methods 56 CLONING STRUCTURE AND EXPRESSION OF 
cDNA libraries THE DROSOPHILA E75 GENE THAT ENCODES 
Northern blot analysis TWO MEMBERS OF THE STEROID 
S1 nuclease protection and primer extension analysis RECEPTOR SUPERFAMILY 
DNA sequence analysis The following experiments demonstrate that the E75 gene 
D. The E75 Gene Encodes Two Members of the Steroid 35 encodes two members of the steroid receptor superfamily. 
Receptor Superfamily This is due to the receptor amino acid sequence homology 
Methods to the conserved DNA-binding and ligand-binding domains 
Protein sequence analysis of this superfamily, and that E75 is an ecdysone-inducible 
E. Expression Vectors for E75 Proteins gene that occupies and is responsible for the ecdysone- 
40 inducible early puff at the 75B locus in the Drosophila 
EXAMPLE II polytene chromosome. 
A. Cloning of Genomic DNA Encompassing the Ecdysone- 
CLONING, STRUCTURE AND EXPRESSION Inducible 75B Puff Locus 
OF THE EcR AND DHR3 GENES THAT We have used the method of chromosomal walking 
ENCODE ADDITIONAL MEMBERS OF THE 45 (Bender, W., P. Spierer, and D. S. Hogness, 1983. Chromo- 
STEROID RECEPTOR SUPERFAMILY somal walking and jumping to isolate DNA from the Ace 
A. Identification and Chromosomal Mapping of EcR and and rosy loci and the Bithorax complex in Drosophila 
DHR3 Genomic Clones melanogaster. J. Mol. Biol. 168:17—33) to isolate the 
B. Structure of the EcR and DHR3 Genes and their cDNAs genomic DNA encompassing the 75B puff region. The 
Methods 50 starting point for the walk was a genomic clone, 48253 (a 
Isolation of cDNA and additional genomic clones gift of J. Burke), which had been localized by in situ 
DNA sequence analysis hybridization to the proximal end of 75B. Isolated restriction 
C. The Predicted Amino Acid Sequence of the EcR and fragments of 48253 were used to screen a library of genomic 
DHR3 Proteins and their Implications DNA from the Canton S (C?) strain of D. melanogaster 
D. In Situ Labeling of the EcR and DHR3 Proteins with 55 (Maniatis, T., R. C. Hardison, E. Lacy, J. Lauer, C. 
Antibodies Induced by Proteins Produced in E. coli O'Connell, D. Quon, G. K. Sim, and A. Efstradiatis, 1978. 
The isolation of structural genes from libraries of eucaryotic 
EXAMPLE III DNA. Cell 15:687—701). Genomic clones AcDm3504 and 


AcDm3505 were isolated by homology to 28253. 

The walk was then extended in both directions until ~100 
kb of genomic DNA had been isolated, when the orientation 
of the walk was determined by in situ hybridization of the 
Mr : terminal segments to polytene chromosomes. Thereafter, the 
A. The EcR Protein Binds Ecdysteroids walk was extended in the rightward direction on the molecu- 

Methods 65 lar map, or distally relative to the centromere. The 350 kb of 

Extracts genomic DNA encompassed by the walk corresponds to the 

Hormone-binding assays chromosomal region between bands 75A6-7 and 75B11-13, 


THE ECDYSTEROID-BINDING, DNA-BINDING 60 
AND GENETIC REGULATORY PROPERTIES 
OF THE EcR PROTEIN DEMONSTRATE THAT 
IT IS AN ECDYSONE RECEPTOR 
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as determined by in situ hybridization. This region includes 
the 75B puff, which appears to initiate by simultaneous 
decondensation of chromosomal bands 75B3-5 and then 
spreads to surrounding bands. 

Methods 

Genomic DNA Libraries 

Canton S genomic DNAs were isolated from a library of 
sheared, EcoRI-linkered Canton S DNA cloned into the 
Charon 4 À phage vector (Maniatis, T., R. C. Hardison, E. 
Lacy, J. Lauer, C. O'Connell, D. Quon, G. K. Sim, and A. 
Efstradiatis, 1978. The isolation of structural genes from 
libraries of eucaryotic DNA. Cell 15:687—701). O" genomic 
DNAs were isolated from a library of sheared DNA, 
GC-tailed into the sep6 À vector (Meyerowitz, F. M., and D. 
S. Hogness, 1982. Molecular organization of a Drosophila 
puff site that responds to ecdysone. Cell 28:165-176). One 
step in the chromosomal walk was taken using a cosmid 
library (prepared in collaboration with S. Gemeraad) of Sau 
IIa partially digested O” DNA cloned into the cosmid 
p14B1 by the method of Ish-Horowicz and Burke (Ish- 
Horowicz, D., and J. F. Burke, 1982. Rapid and efficient 
cosmid cloning. Nucleic Acids Res. 9:2089—2998). 

In Situ Hybridization 

In situ hybridization to polytene chromosomes was car- 
ried out with DNA probes that were nick-translated in the 
presence of ?H-labeled TTP (NEN), as described by Bonner 
and Pardue (Bonner, J. J., and M. L. Pardue, 1976. 
Ecdysone-stimulated RNA synthesis in imaginal discs of 
Drosophila melanogaster. Assay by in situ hybridization. 
Chromosoma 58:87—99), with the following modifications: 
Heat and RNAase treatments of the slides were omitted, and 
hybridization and washing were at 63? C. in 2xSSPE for 18 
and 2 hours, respectively. 

B. Identification of a 50 kb Region of Cloned Genomic DNA 
that Contains Sequences Homologous to Ecdysone-induced 
Transcripts 

Restriction fragments of the above genomic clones were 
tested for their ability to hybridize with each of two cDNA 
probes: one derived from the RNA in ecdysone-induced 
cells, and the other from the RNA in noninduced cells. Two 
such differential screens were carried out. In the first, 
genomic DNA covering the entire 350 kb walk was exam- 
ined with cDNA probes synthesized with reverse tran- 
scriptase from an oligo(dT) primer annealed to poly(A)+ 
RNA. The poly(A)+ RNA was prepared from total inner 
tissues that were mass-isolated from late third instar larvae 
and incubated in the presence of ecdysone plus 
cycloheximide, or cycloheximide alone. (See Methods, 
below. Cycloheximide was included because higher levels 
of ecdysone-induced transcripts accumulate in its presence.) 

Each of the ??P-labeled cDNA probes made from these 
two poly(A)+ RNAs was applied to one of two duplicate 
Southern blots that contained, in addition to the genomic 
fragments from the walk, a control DNA consisting of 
sequences from the ribosomal protein 49 gene (O'Connell, 
P., and M. Rosbash, 1984. Sequence, structure and codon 
preference of the Drosophila ribosomal protein 49 gene. 
Nucleic Acids Res. 12:5495-5513), which was used to 
normalize the hybridization intensities of the duplicate blots. 
This screen revealed sequences specific to ecdysone- 
induced RNAs only within the AcDm3522 genomic clone 
that is centered at approximately +220 kb on the molecular 
map. 

Because the above probes will preferentially detect 
sequences near the 3' termini of the RNAs, particularly in the 
case of long transcripts, a second differential screen was 
carried out with cDNA probes primed with random hexam- 
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ers (see Methods, below). This screen, which was restricted 
to the 135 kb of genomic DNA between +105 kb and +240 
kb, revealed ecdysone-inducible sequences in fragments 
spread out over an —50 kb region between +170 kb and +220 
kb. This region represents the E75 gene. 

Methods 

Organ Culture and RNA Isolation 

Late third instar O” larvae were harvested, washed in 
0.7% NaCl, resuspended in Robb's phosphate-buffered 
saline (PBS) (Robb, J. A., 1968. Maintenance of imaginal 
discs of Drosophila melanogaster in chemically defined 
media. J. Cell. Biol. 41:876—885), preaerated with a blender, 
and passed through a set of rollers to extrude the organs. 
This “grindate” was filtered through a coarse Nitex screen to 
remove carcasses, and settled five times (3-5 minutes per 
settling) by gravity to remove floating and microscopic 
debris. Isolated tissues (primarily salivary glands, imaginal 
discs, gut, and Malphigian tubules) were cultured at 25? C. 
in plastic petri dishes in aerated Robb’s PBS. f-ecdysone 
(Sigma) (0.2 ug/ml of 10 mg/ml) in ethanol and/or cyclo- 
heximide (2 ug/ml of 35 mM) in water was added to the 
appropriate cultures. Incubations in the presence of cyclo- 
heximide were for ~8 hours. Isolated tissues were homog- 
enized in 10 volumes of 6 M guanidine-HC1/0.6 M sodium 
acetate (pH 5.2), centrifuged at 5000 g for 10 minutes to 
remove debris, and layered onto a 5.7 M CaCl shelf, as 
described previously (Chirgwin, J. M., A. E. Przbyla, R. J. 
MacDonald, and W. J. Rutter, 1979. Isolation of biologically 
active ribonucleic acid from sources enriched in ribonu- 
clease. Biochemistry 18:5294—5299). Poly(A)+ RNA was 
purified by oligo(dT) chromatography. 

Southern Blot Analysis 

Southern blots were performed on nitrocellulose, as 
described previously (Segraves, W. A., C. Louis, S. Tsubota, 
P. Schedl, J. M. Rawls, and B. P. Jarry, 1984. The rudimen- 
tary locus of Drosophila melanogaster. J. Mol. Biol. 
175:1-17). cDNA probes were prepared by reverse tran- 
scription (AMV reverse transcriptase; Seikagaku) of 2 ug of 
poly(A)+ RNA with 700 ng of oligo(dT)!?"?? (Collaborative 
Research) or 15 ug of random hexamers (Pharmacia) in a 20 
ul reaction mixture containing 80 mM Tris Cl (pH 8.3 at 42? 
C.), 10 mM MgCL, 100 mM KCI, 0.4 mM DTT, 0.25 mM 
each of DATP, dGTP, and dTTP, and 100 Ci of ['^PIdCTP 
(800 Ci/mole; Amersham). After incubation at 37? C. for 45 
minutes, 80 ul of 10 mM EDTA and 2 ul of 5 N NaOH were 
added before incubation at 70? C. for 10 minutes to denature 
the products and hydrolyze the RNA. After the addition of 
10 wl of 1 M Tris Cl (pH 7.5) and 5 wl of 1 N HCl, 
unincorporated label was removed by chromatography on 
Biogel P60. 
C. The E75 Gene Contains Two Overlapping Transcription 
Units: E75A and E75B 

Northern blot analysis of ecdysone-induced and nonin- 
duced RNAs, prepared as described above and hybridized 
with strand-specific DNA probes derived from cloned 
restriction fragments in the 60 kb region (+166 to +226 kb) 
containing the E75 gene, demonstrated that this gene pro- 
duces two classes of ecdysone-inducible mRNAs, both 
derived from rightward transcription. The E75A class of 
mRNAs hybridized with probes from both the 5' (left) and 
3' (right) ends of the 50 kb E75 gene. The E75B class 
hybridized only with probes from the 3' proximal 20 kb of 
the gene. These results suggest that the A and B classes of 
ecdysone-inducible RNAs are initiated by different 
promoters, located about 30 kb apart and that the two 
transcription units defined by these promoters overlap in the 
region downstream from the B promoter. 
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This suggestion was confirmed by analysis of the struc- 
ture of cloned cDNAs from the E75A and E75B mRNAs. 
Approximately 10° clones from an early pupal cDNA library 
(Poole, S. J., L. M. Kauvar, B. Drees, and T. Kornberg, 1985. 
The engrailed locus of Drosophila: Structural analysis of an 
embryonic transcript. Cell 40:37-40) were screened at low 
resolution with genomic DNA probes from the E75 gene 
region. The 116 cDNA clones identified by this screen were 
analyzed by restriction digestion and hybridization to a 
panel of probes derived from the 60 kb (+166 to +226 kb) 
region. One of the clones, ÀDm4925, was thereby selected 
as a representative of the E75A class of mRNAs, and 
another, 4Dm4745, as a representative of the E75B mRNA 
class. 

The genomic regions homologous to these two cDNA 
clones were further localized by Southern blot analysis, and 
the nucleotide sequence of these regions and of both cDNA 
clones was determined. These sequences are given in Table 
1, along with those derived from 5' and 3' terminal sequence 
determinations for each transcription unit. These data dem- 
onstrate that the 50 kb E75A transcription unit consists of six 
exons, labeled in 5' to 3' order: AO, A1, 2, 3, 4 and 5, of 
which exons AO and Al are specific to this unit, while the 
remaining four are shared with the 20 kb E75B transcription 
unit. Similarly, the E75B unit contains a specific exon, 
labeled B1, at its 5' end, which is located just upstream of the 
shared exon 2. Thus, the E75 gene consists of two transcrip- 
tion units, of which the shorter E75B unit occupies the 3' 
proximal 20 kb of the longer E75A unit. 

Table 1. Sequences of the E75 exons and flanking DNA. 
The sequence is that of the C' genomic DNA, which was 
identical to that of the cDNAs, except for the T—G change 
indicated at position +2691. This change would convert a 
leucine to an arginine in the protein sequences. The Dm4925 
cDNA extends from just 5' of the EcoRV site at +939 to 
+4267 in A. The Dm4745 cDNA extends from +804 in B to 
a point near the HindIII site at +4246 in A. (A) The E75 A 
exons and flanking DNA. The sequences of the AO, A1, and 
common exons 2-5 are interrupted by intron sequences 
(lowercase), which are limited to those near the splice sites 
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and are in agreement with consensus sequences for donor 
(5') and acceptor (3) splice sites. Numbers at the right end 
of each line refer to the number of base pairs upstream of the 
E75 A initiation site if negative, positive numbers refer to 
positions in the E75 A mRNAs, continuing into the 3' 
flanking DNA. Numbers at the left end of each line refer to 
amino acid residues in the E75 A protein. The underlined 14 
bp sequence at -159 to -172 exhibits a 13/14 bp match to 
asequence (CGTAGCGGGTCTO) found 47 bp upstream of 
the ecdysone-inducible E74 A transcription unit responsible 
for the early puff at 74EF. This sequence represents the 
proximal part of a 19 bp sequence in the E74 A promoter that 
binds the protein encoded by the D. melanoaaster zeste 
gene. Another underlined sequence in the E75 A promoter at 
—74 to -82 is also found in the E75 B promoter, where it is 
part of a tandemly repeated octanucleotide (GAGAGAGC) 
located at -106 to -121 in B. This repeat matches the 
consensus sequence for the binding sites of the GAGA 
transcription factor which also binds to the E74 A promoter. 
Other underlined sequences represent, at -27 to —33, the best 
match to the TATA box consensus at an appropriate position, 
three AUG codons that are closely followed by in-frame stop 
codons in the 5'-leader sequence of the E75 mRNAs, and 
alternative polyadenylation-cleavage signals at 4591 and 
5365 that are used by both E75 A and E75B mRNAs. (B) 
The B1 exon and its 5'-flanking DNA. The numbering at the 
right and left ends of the lines follows the same convention 
as in A. Exons 2-5 shown in A are also used in E75 B, but 
the amino acid residues and base pair numbers shown in A 
must be increased by 157 and 375, respectively, to apply to 
the E75 B protein and mRNA. The first ten nucleotides of 
the 136-nucleotide E75 B-intron linking the B1 exon to 
Exon 2 are gtaggttag, whereas the last ten are shown 
upstream of nucleotide 1178 in A. The underlined sequences 
represent, in order, the region of homology to a sequence 
upstream of E75 A, noted above, the best match to the TATA 
box consensus at -21 to -27, and three AUG codons 
followed by in-frame stop codons in the 5' leader of the E75 
B mRNA. 


ACTTACTAGTGAAAAACATGATAATAAACAACTTGCCAAAAAAAATCCAATGAAATTGACA 


CTTATGTTAAAAAAATAGGTGAGATTGTAACCGTTGATGTACACTTACGAAGTACGTAACAAGTTCATGA 
-141 

ACTGATTTCGTGAGCAGGTCTCTCCATAATCGCCGTATCTGTGGGATCGCGCGCTCCTGCTCGCACTCGC 

TGGGTGGATGGCAGCACATGTTCGAAGTGCGAGAGAGTGCAAAGCGGAGAGCGCCGACGTCGACGCCGAA 


+L 
AAAACTGAACAAGATCCGCCGCGAATGTTGATTTTCCTTTCATTGACTAACTGCCACTCGCAGCGCGCAG 


I. mRNA start site 


ATCGTCGGCTCCGCTTGTTCCGTTCCGTTCGTTTCGTTTCGTTTCGTTCGATCTACTTCGAGTCGCGAGT 

TTTAAGCAGTGTAGTGAGTGCCCCGTGAAAAGGATAACCCAAAAAGTGATTTCTACTATTTTCCAATAGT 
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TTTTATCAGTGTGAAGAAAACATGTAAACTTGGCTCAAAAAGGGCTTTAAAAGATACAAAGCTTCAATGC 


GAAGGATAAAATAATATCGCACCAGTGCTTCAAAAACCAAAACTATGCCTAAGGCTGGAAATTTAAATTA 
AAATTTTTTTAATAAATATTCCAAAAATATTGCCCCTGAAAAGTGTTGATAAACCCCCAACCGAGCAAA 
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GGT 
Gly 


GCT 
Ala 


TTC 
Phe 


GTG 
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CAG 
Gln 
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GTGAGTTGTG..intron 1..TTCTTTGCAG 


1082 


AAC ATA GAA TTC GAT GGC ACC ACA GTG 
Glu Phe Asp Gly Thr 


Asn 


TCC 
Ser 


ccc 
Arg 


AGC 
Ser 


ATT 
Ile 


1352 
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Glu 


CAG 
Gln 


CTG 
Leu 


CGG 
Arg 


CCG 
Pro 


1622 
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Ala 
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GGT 
Gly 


TCC 
Ser 


ATT 
Ile 


GCC 
Ala 


AAG 
Lys 
CGA 
Arg 


CGC 
Arg 


CAG 
Gln 


CTG 
Leu 


CAC 
His 


CTC 


Leu 


TTT 
Phe 


TTC CAT 
Phe His 


ATC CAG 
Ile Gln 


CTG CGC 
Leu Arg 


GTG GGC 
Val Gly 


GCG CGT 
Ala Arg 


GCC CTC 
Ala Leu 


GCC CAC 
Ala His 


CGG GCG 
Arg Ala 


AAC CCC 
Asn Pro 


GTA ATT 
Val Ile 


ACC CAG 
Thr Gln 


GTG CGC 
Val Arg 


TAC 


GGC 


GTG 


Tyr Gly Val 


CAA 
Gln 


ATC 
Ile 


AAG 
Lys 


AAT 
Asn 


ATC 
Ile 


ccc 
Arg 


Thr 


CAT 
His 


CAG 
Gln 


AAT 
Asn 


Val 


TCC 
Ser 


TAT 
Tyr 


CGT 
Arg 


CGA TCT CAC CTG GAA AAT GCC CTG AAA 
Arg Ser His Leu Glu Asn Ala Leu Lys 


TAC TAC CAG CAC AAC AGC AAG CTG GGC 
Tyr Tyr Gln His Asn Ser Lys Leu Gly 


AGG AGC CTG GTA GCA CCT GTC ACA GAT 
Arg Ser Leu Val Ala Pro Val Thr Asp 


ACC ATG GCG AGT TCT TCG AAT TCT ccc 
Thr MET Ala Ser Ser Ser Asn Ser Pro 


CGC GTG ATC TTT GTC AGC AAA TCG AGT 
Gly Val Ile Phe Val Ser Lys Ser Ser 


GCA GTG TTG CAA CAG CAG CAG CCG CAG 
Ala Val Leu Gln Gln Gln Gln Pro Gln 


CTG CCC CAC CAC CAC CCC CAG CAG GAA 
Leu Pro His His His Pro Gln Gln Glu 


CAT CAC CTT CAG CAC CAC CCA CAT CCA 
His His Leu Gln His His Pro His Pro 


CAG GCC AAT CTG CAC CAC TCG GGT GGT 
Gln Ala Asn Leu His His Ser Gly Gly 


CCC CAG ACT CCC GAG TAC ATC AAG TCC 
Pro Gln Thr Pro Glu Tyr Ile Lys Ser 


AGT TCG GTA AAG GGG GAA CCA GAA CTC 
Ser Ser Val Lys Gly Glu Pro Glu Leu 


CTG TGC CGC GTT TGC CGG GAT AAG GCC 
Leu Cys Arg Val Cys Gly Asp Lys Ala 


GTAAGTTCGT..intron 2..ATCGTTTCAG 


TGG GAG GGT TGC AAG GGA TTC TTC CGC 
Cys Glu Gly Cys Lys Gly Phe Phe Arg 
270 


AGA AAG TGC ACC AAG AAT CAG CAG TGC 
Arg Pro Cys Thr Lys Asn Gln Gln Cys 


TGT CAA TAT TGC CGC CTG AAA AAG TGC 
Cys Gln Tyr Cys Arg Leu Lys Lys Cys 


GTGAGTACCT..intron 3..CCAATTGCAG 


ATG AGT GGC GAT GCT GTG CGT TTT GGA CGC GTG CCG AAG CGC 


MET 


ATC 
Ile 


GCC 
Ala 


CTC 
Leu 


CGG 
Arg 


GCC 
Ala 


CGC 
Arg 


GAC 
Asp 


CTG 
Leu 


Ser 


CTG 
Leu 


ACC 
Thr 


GAG 
Glu 


GAT 
Asp 


CCT 
Pro 


GGC 
Gly 


GAT 
Asp 


ATC 
Ile 


Arg 


GCG 
Ala 


GAG 
Glu 


ACC 
Thr 


TGC 
Cys 


GAA 
Glu 


GTG 
Val 


AAG 
Lys 


TGC 
Cys 


Asp 


GCC 
Ala 


CTG 
Leu 


TGT 
Cys 


ccc 
Pro 


CTG 
Leu 


ATC 
Ile 


TIC 
Phe 


ATG 
MET 


Ala 


ATG 
MET 


GAT 
Asp 


GAG 
Glu 


TCC 
Ser 


CAA 
Gln 


GAC 
Asp 


ACG 
Thr 


TTT 
Phe 


Val Arg Phe Gly Arg Val Pro Lys Arg 
324 


CAA CAG AGC ACC CAG AAT CGC GGC CAG 
Gln Gln Ser Thr Gln Asn Arg Gly Gln 


GAC CAG CCA AGA CTC CTC GCC GCC GTG 
Asp Gln Pro Arg Leu Leu Ala Ala Val 


TTC ACC AAG GAG AAG GTC TCG GCG ATG 
Phe Thr Lys Glu Lys Val Ser Ala MET 
378 


GTAAGTCTCA..intron 4..ATTTCTTCAG 


TAC TCC ATG CCC ACA CTT CTG GCC TGT 
Tyr Ser MET Pro Thr Leu Leu Ala Cys 


TCG GAG CAG GAG TTC TCG CAG CGT TTC 
Ser Glu Gln Glu Phe Ser Gln Arg Phe 


TTT GCC GGC ATG ATT CCC GGC TTC CAG 
Phe Ala Gly MET Ile Pro Gly Phe Gln 
432 


CTC CTG AAG GCG GGA CTC TTC GAC GCC 
Leu Leu Lys Ala Gly Leu Phe Asp Ala 


GAC TCG TCG ATA AAC TCA ATC ATC TGT 
Asp Ser Ser Ile Asn Ser Ile Ile Cys 


28 


CTA AAT 
Leu Asn 


TTC CTG 
Phe Leu 


892 
ACA GAT 
Thr Asp 


CCC GGT 
Pro Gly 


TGC CTG 
Cys Leu 


AAG TTG 
Lys Leu 


AAA CTG 
Lys Leu 


2162 
TCC ATG 
Ser MET 


AGC TGG 
Ser Trp 


TCG AGC 
Ser Ser 


CAG GGC 
Gln Gly 


TCG GCT 
Ser Ala 


2432 
GCC AAT 
Ala Asn 


TCG CAC 
Ser His 


GCA 
Ala 


CAG 
Gln 


TTG 
Leu 


CAC 
His 


TCG 
Ser 


GGA 
Gly 


AGC 
Ser 


GTG 
Val 


CGA 
Arg 


CTG 
Leu 


CGG 
Arg 


ACT 
Thr 


AAC 
Asn 


GCC 
Ala 


CAG 
Gln 


GAC 
Asp 


GAG 
Glu 


CGC 
Arg 


TAC 
Tyr 


GAG 
Glu 


GTT 
Val 


GAC 
Asp 


GAT 
Asp 


GAG 
Glu 


TCT 
Ser 


CTG 
Leu 


GGC 
Gly 


CAT 
His 


CAA 
Gln 


AAT 
Asn 


CAC 
His 


GGC 
Gly 


AGT 
Ser 


GAT 
Asp 


TCC 
Ser 


CAG 
Gln 


TAC 
Tyr 


GCC 
Ala 


TCC 
Ser 


AGT 
Ser 


CAG 
Gln 


GTG 
Val 


TCC 
Ser 


CTC 
Ile 


AAC 
Asn 


ATT 
Ile 


ACG 
Thr 


TTC 
Phe 


GGC 
Gly 


GCC 
Ala 


TCC 
Ser 


CTG 
Leu 


CTG 
Leu 


TCC 
Ser 


CTC 
Leu 


CAA 
Gln 


GGG 
Gly 


TTG 
Leu 


ATT 
Ile 


TCC 
Ser 


GCC 
Ala 


GTG 
Val 


ATT 
Ile 


GAT 
Asp 


CTG 
Leu 


GGC 
Gly 


GTG 
Val 


AGC 
Ser 


ATG 
MET 


ACC 
Thr 


GGC 
Gly 


CTG 
Leu 


GTC 
Val 


ATG 
MET 


ccc 
Arg 


AAC 
Asn 


ATG 
MET 


GCC 
Ala 


ccc 
Pro 


GCG 
Ala 


AGC 
Ser 


ACC 
Thr 


CAG 
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GGA 
Gly 


ACA 
Thr 


GAG 
Glu 


TCG 
Ser 


GCC 
Ala 


TCA 
Ser 


GTG 
Val 


ACC 
Thr 


CGG 
Arg 


TCG 
Ser 


GCC 
Ala 


CAG 
Gln 
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CGA 
Arg 
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Phe 


CTG 
Leu 


GAG 
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Ala 
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Pro 
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GAC 
Asp 
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GAT 
Asp 


TTC 
Phe 


TGC 
Cys 


ATC 
Ile 


AAT 
Asn 


CTG 
Leu 


CAC 
His 


GAT 
Asp 


GAG 
Glu 


GAC 
Asp 


CCT 
Pro 


CTC 
Leu 


TCC 
Ser 


GGG 
Gly 


GCG 
Ala 


GCG 
Ala 


GCC 
Ala 


GAG 
Glu 


AGG 
Arg 


CGC 
Arg 


AAG 
Lys 


GGC 
Gly 


GCG 
Ala 


TAC 
Tyr 


CAG 


Gln 


TCC 
Ser 


GGA 
Gly 


CTG 
Leu 


ATC 
Ile 


GAG 
Glu 


ATC 
Ile 


AAG 
Lys 


ccc 
Pro 


ACC 
Thr 


GAG 
Glu 


CAG 
Gln 


GCC 
Ala 


GGC 
Gly 


CAA 
Gln 


GGA 
Gly 


GCA 
Ala 


ACA 
Thr 


CAG 
Gln 


CGC 
Arg 


GTT 
Val 


CTG 
MET 


GAT 
Asp 


CTG 
Leu 


CTG 
Leu 


CAG 
Gln 


AAG 
Lys 


AGT 
Ser 


CAG 
Gln 


GGA 
Gly 


GCT 
Ala 


ATC 
Ile 


AAC 
Asn 


ATG 
MET 


CTG 
Leu 


TAC 
Tyr 


CAG 
Gln 


AGC 
Ser 


CTG 
Leu 


AAC 
Asn 


AGT 
Ser 


CCG 
Pro 


ccc 
Pro 


TGT 
Cys 


GAG 
Glu 


ACG 
Thr 


GGA 
Gly 


AAC 
Asn 


ATT 
Ile 


TCG 
Ser 


CCC 
Pro 


ACC 
Thr 


CGC 
Arg 


AAG 
Lys 
CCG 
Pro 


AGC 
Ser 


TCG 
Ser 


ccc 
Pro 


ATG 
MET 


CCG 
Pro 


GCC 
Ala 


TCG 
Ser 


ACG 
Thr 


CGA 
Arg 


GAG 
Glu 


CTG 
Leu 


CAG 
Gln 


TCG 
Ser 


CTT 
Leu 


AGT 
Ser 


GCT 
Ala 


CTG 
Leu 


GAT 
Asp 


ATT 
Ile 


GTAGTATCTT..intron 5..TTTCTTACAG 


CAG 
Gln 


CAC 
His 


GEC 
Ala 


TCG 
Ser 


TGC 
Cys 


GCC 
Ala 


CCA 
Pro 


GAG 
Glu 


AAC 
Asn 


CAT 
His 
AAC 
Asn 


ACG 
Thr 


CTG 
Leu 


CAG 
Gln 


GCG 
Ala 


GGA 
Gly 


GGC 
Gly 


TCC 
Ser 


AAT 
Asn 


GTT 
Val 


GAT 
Asp 


TCG 
Ser 


ccc 
Arg 


AGC 
Ser 


ccc 
Pro 


CAC 
His 


CAG ATC GGA 


Gln 


ATG 
Met 


GCT 
Ala 


AAC 
Asn 


AGT 
Ser 


CAC 
His 
ccc 
Arg 


ATG 
MET 


CTG 
Leu 


GAG 
Glu 


CTG 
Leu 


CCG 
Pro 


ATG 
MET 


Ile 


TGC 
Cys 


GCC 
Ala 


GAG 
Glu 


CCG 
Pro 


AAT 
Asn 


TCG 
Ser 


ccc 
Pro 


ATG 
MET 


TTC 
Phe 


AGT 
Ser 


CCA 
Pro 


CAC 
His 


Gly 


CAG 
Gln 


ccc 


ATA 
Ile 


CAA 
Gln 


TAC 
Tyr 


AAC 
Asn 


TCC 
Ser 


GTG 
Val 


CAG 
Gln 


CTG 
Leu 


GAG 
Glu 


ACC 
Thr 


GGC 
Gly 


GCG 
Ala 


ACC 
Thr 


CTC 
Leu 


CAG 
Gln 


AGA 
Arg 


GAG 
Glu 


AGT 
Ser 


GTG 
Val 


CCC 
Pro 


AAG 
Lys 


GCC 
Ala 


GCC 
Ala 


AGT 
Ser 


GCC 
Ala 


CGC 
Arg 


AAT 
Asn 


CAG 
Gln 


AAG 
Lys 


TGC 
Cys 


GTG 
Val 


CAG 
Gln 


TCC 
Ser 


CGC 
Arg 


TAC 
Tyr 


GAG 
Glu 


CCG 
Pro 


AGC 
Ser 


AGC 
Ser 


AAT 
Asn 


CAG 
Gln 


GTG 
Val 


GCC 
Ala 


AAC 
Asn 


GAT 
Asp 


AAG 
Lys 


CTG 
Leu 


ACC 
Thr 


ATG 
MET 


TCG 
Ser 


TCG 
Ser 


CAG 
Gln 


GCC 
Ala 


AAC 
Asn 


GTT 
Val 


CGA 
Arg 


CAT 
His 
CCA 
Pro 


G(Dm4925) 
CTA GAT TCG 
Leu Asp Ser 


Arg 


AAG 
Lys 


GAT 
Asp 


CAT 
His 
ACC 
Thr 


GTG 
Val 


AAG 
Lys 


GCG 
Ala 


CGG 
Arg 
GCC 
Ala 


AGC 
Ser 


Lys 


ccc 
Arg 
486 
CTG 
Leu 


ccc 
Arg 
GGC 
Gly 
540 


GCC 
Ala 


GAG 
Glu 


TGG 
Trp 
594 


GGC 
Gly 


GTA 
Val 


CCA 
Pro 
648 
AGC 
Ser 


CGG 
Arg 


GGC 
Gly 
702 
CAC 
His 


TCC 
Ser 


CAA 
Gln 
756 


ccc 
Pro 


AGT 
Ser 


CTG 
Leu 
810 
CTG 
Leu 


CAT 
His 
GCT 


Ala 
864 


AAG 
Lys 
AGC 
Ser 


CCA 
Pro 
918 


GGT 
Gly 


GCC 
Ala 


30 
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3242 
TCG ATG GCC AGC TCG CAC TCG GTG CTG GCC AAG TCT CTC ATG GCC GAG CCG CGC 
Ser MET Ala Ser Ser His Ser Val Leu Ala Lys Ser Leu MET Ala Glu Pro Arg 


ATG ACG CCC GAG CAG ATG AAG CGC AGC GAT ATT ATC CAA AAC TAC TTG AAG CGC 
MET Thr Pro Glu Gln MET Lys Arg Ser Asp Ile Ile Gln Asn Tyr Leu Lys Arg 


GAG AAC AGC ACA GCA GCC AGC AGC ACC ACC AAT GGC GTG GGC AAC CGC AGT CCC 
Glu Asn Ser Thr Ala Ala Ser Ser Thr Thr Asn Gly Val Gly Asn Arg Ser Pro 


AGC AGC AGC TCC ACA CCG CCG CCG TCG GCG GTC CAG AAT CAG CAG CGT TGG GGC 
Ser Ser Ser Ser Thr Pro Pro Pro Ser Ala Val Gln Asn Gln Gln Arg Trp Gly 


AGC AGC TCG GTG ATC ACC ACC ACC TGC CAG CAG CGC CAG CAG TCC GTG TCG CCG 
Ser Ser SeR Val Ile Thr Thr Thr Cys Gln Gln Arg Gln Gln Ser Val Ser Pro 


3512 
CAC AGC AAC GGT TCC AGC TCC AGT TCG AGC TCT AGC TCC AGC TCC AGT TCG TCA 
His Ser Asn Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 


TCC TCC TCC ACA TCC TCC AAC TGC AGC TCC AGC TCG GCC AGC AGC TGC CAG TAT 
Ser Ser Ser Thr Ser Ser Asn Cys Ser Ser Ser Ser Ala Ser Ser Cys Gln Tyr 


TTC CAG TCG CCG CAC TCC ACC AGC AAC GGC ACC AGT GCA CCG GCG AGC TCC AGT 
Phe Gln Ser Pro His Ser Thr Ser Asn Gly Thr Ser Ala Pro Ala Ser Ser Ser 


TCG GGA TCG AAC AGC GCC ACG CCC CTG CTG GAA CTG CAG GTG GAC ATT GCT GAC 
Ser Gly Ser Asn Ser Ala Thr Pro Leu Leu Glu Leu Gln Val Asp Ile Ala Asp 


TCG GCG CAG CCT CTC AAT TTG TCC AAG AAA TCG CCC ACG CCG CCG CCC AGC AAG 
Ser Ala Gln Pro Leu Asn Leu Ser Lys Lys Ser Pro Thr Pro Pro Pro Ser Lys 


3782 
CTG CAC GCT CTG GTG GCC GCC GCC AAT GCC GTT CAA AGG TAT CCC ACA TTG TCC 
Leu His Ala Leu Val Ala Ala Ala Asn Ala Val Gln Arg Tyr Pro Thr Leu Ser 


GCC GAC GTC ACA GTG ACA GCC TCC AAT GGC GGG TCC TCC GTC GGC GGC GGC GAG 
Ala Asp Val Thr Val Thr Ala Ser Asn Gly Gly Ser Ser Val Gly Gly Gly Glu 


TCC GGC CGC CAG CAG CAG TCC GCC GGC GAG TGT GGG CTC CCC CAA TCC GGG CCT 
Ser Gly Arg Gln Gln Gln Ser Ala Gly Glu Cys Gly Leu Pro Gln Ser Gly Pro 


GAG CGC CGC CGT GCA CAA GGT AAT GCT GGA GGC GTA AGA GCG GGA GGA GGT AGG 
Glu Arg Arg Arg Ala Gln Gly Asn Ala Gly Gly Val Arg Ala Gly Gly Gly Arg 


TGG TTT TAC GCG GAG AAG TGG GAG AGA CAG AGA CTG GGA GTG GCA GTT CAG CGA 
Trp Phe Tyr Ala Glu Lys Trp Glu Arg Gln Arg Leu Gly Val Ala Val Gln Arg 


4052 

AGC AGG AAG CAG GAT CAC TTG GAG CGG CGG GAG TTG AAT TAA 

Ser Arg Lys Gln Asp His Leu Glu Arg Arg Glu Leu Asn 
1237 


ATTATTTTACCATTTAATTGAGACGTGTACAAAGTTTGAAAGCAAAACCAACATGCATGCAATTTAAAAC 
TAATATTTAAAGCAACAACAAACAAAACAACTACAAGTTATTAATTTAAAAAACAAACAAACAAACAAAC 


4234 
AACAAAAAACCCAAGCTTGAATGGTATTACAAAAGAAAAAGAAAAACAGAAAAAATATAAATATATTTTA 


GCAGTTAAACTTTAACGTAGCAAGAAACCAACAAACCCAAGGCAGCGCTCTGATTTCGCATTAACTTTTC 


4374 
TTCAGCTGCTACCGAAAACGCCCCTCACCTCCCCCCCACCCAACCCTTCCTCCACACACCAACCGTCTTT 


CGACCCCTGATTGTTTTATAAGTTTTAAGCTCTTGTTGTACATATTAATTACGTTTATTGGTAACTATGT 


4514 
TTAGCGCTTTAGTTGTAGTTGGAGCAAAACTACTTTGCTTTTTTGGATGTTTTTTGAAAAAACTGCAAAT 


TATTATTATTAAATTTTTAAATACCTAAAAACAAAACAATGTGTGTGAAATTTTTTATTGTGCGATCTCC 


b poly A site cDm4927 and cDm4928 
4654 
AAGCAGAATGAAGTGCAGTTTGCAACAAATTTTAACTACGATTAAGTTGATAACGATTCATTTTTTATGA 
ATTTAACTAATTTTATGAATTTGTTATAGTTTTCCACCCTTCTATAGATCTTCTATCTGATCATCTAGCT 


4794 
ACCCGTATTCCTGATTTCTCCTTTGGCACAAAGCTCTTCTCTATGCTAAAGAATCAAGTGGAATAAATAT 


TGTTTTCTAATTTTAAAACTACCACAAAAATACGATTAAAATATACACGAAGTAATGAAAATCAAACAAA 


4934 
ATGCTTAAAGTTTTAGCAGCAAGCAGTAAAACGACGATGAAGAAGAGAAACCCAACGTTAAATATATCTG 


TTGTGTACATAGTTAAATGTTAAATTAAACACAAAAACATATTTAAAGTACATATAAATACACATAATTA 


5074 
TTAATGAAGAAACCTATGCTTAAAAGATTCAATGTTTGATTGGCATCTTAGAAAACCAAGCGAAAAATAC 


AAAAAAAAATCAACAAACAAAAATTATGATATATTATTTAAAAGTAAAGTATACATTTACATTACAGAAA 


5214 
AACAAAAGAGAAAACTTGCGGTAGCAACAAAACTATTATATTAATTACATTTTAATTATGCTGTACTATT 


32 
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-continued 
ATGATTATTAATTATTATGATTAATTAATTACGATTTTTATGCTTAGACAAACCAACAAAAAACAAATAT 


5354 
GCAAAAACCATTAAAAAAAAAAACAAAAAACAAGCAAAAAAT 


L , putative polyadenylation signal for long transcripts 


B 
CGACGCGTTTGGAGTGAACGTCC'TCAGTTGGCACACAAAAACAAAAACACAAAACGACAGCAACAACATC 


-141 
GGTGGGGGGAGTACGAGCGGGATGGGGGTAATGGGGGGCACCGGGGGAGTGGAGGCCGAGAGAGCGAGAG 


AGCGACCCGAAGCAACACAACACCAACACGAGGCCCAAAAAGACACTTCGGCTGGGTTCAGCTCGTGTTG 


*1 
CTCTGGGTCGTTTTGTATTGCTGGTGGACGCTGCTTTCATTCGCAAATTGCTCGTCGTTGGCAGCGGTTG 


L» mRNA start site 


TGCAGAGCAAGAAAAGCGCGCGAAAAACCAAGCAAAAAATTAATACAGCTGGATCAAGCGAAAGAGATAG 
AGAGCAGAGTCAACAGCAACAAATGTTCAATAGCAAATGATATCGCATATTTTTGTTGGTGCCAGTGAAG 


*211 
TGAGATCAAAGTGAAGTGTGCAATGTTCCTTATTAGCAAATCGTAGAGCAACCAACAATCGAGAGTTCAA 


284 

GTGTCATTTCGAAGCCAAAAAGCAAAATCTCTAATTCAAAAT ATG GTT TGT GCA ATG CAA 
MET Val Cys Ala Met Gln 
1 


GAG GTT GCT GCT GTG CAG CAT CAG CAG CAG CAA CAG CAA CTC CAG TTG CCC CAG 
Glu Val Ala Ala Val Gln His Gln Gln Gln Gln Gln Gln Leu Gln Leu Pro Gln 
24 


CAG CAA CAG CAG CAG CAG CAG ACA ACA CAG CAG CAA CAT GCA ACA ACG ATA GTG 
Gln Gln Gln Gln Gln Gln Gln Thr Thr Gln Gln Gln His Ala Thr Thr Ile Val 


CTG CTG ACG GGC AAT GGC GGC GGT AAT CTG CAC ATT GTC GCC ACA CCG CAA CAG 
Leu Leu Thr Gly Asn Gly Gly Gly Asn Leu His Ile Val Ala Thr Pro Gln Gln 


CAT CAG CCG ATG CAT CAG CTC CAC CAT CAG CAT CAG CAT CAG CAT CAG CAG CAG 
His Gln Pro MET His Gln Leu His His Gln His Gln His Gln His Gln His Gln 
78 


CAG CAG GCC AAG AGC CAA CAG CTG AAG CAA CAA CAC TCG GCG CTG GTC AAG TTG 
Gln Gln Ala Lys Ser Gln Gln Leu Lys Gln Gln His Ser Ala Leu Val Lys Leu 


CTG GAG TCG GCG CCC ATC AAG CAG CAA CAG CAG ACG CCC AAG CAA ATT GTT TAC 
Leu Glu Ser Ala Pro Ile Lys Gln Gln Gln Gln Thr Pro Lys Gln Ile Val Tyr 


CTG CAG CAG CAG CAG CAG CAA CCG CAA CGC AAA AGA CTG AAA AAC GAA GCA GCA 
Leu Gln Gln Gln Gln Gln Gln Pro Gln Arg Lys Arg Leu Lys Asn Glu Ala Ala 
132 


ATC GTA CAA CAG CAA CAA CAA ACA CCT GCA ACA CTA GTA AAG ACA ACA ACC ACC 
Ile Val Gln Gln Gln Gln Gln Thr Pro Ala Thr Leu Val Lys Thr Thr thr Thr 


AGC AAC AGC AAC AGC AAC AAC ACC CAG ACA ACA AAT AGT ATT AGT CAG CAG CAA 
Ser Asn Ser Asn Ser Asn Asn Thr Gln Thr Thr Asn Ser Ile Ser Gln Gln Gln 


CAG CAG CAT CAG ATT GTG TTG CAG CAG CAG CAG CCA GCC GCG GCA GCA ACA CCA 
Gln Gln His Gln Ile Val Leu Gln His Gln Gln Pro Ala Ala Ala Ala Thr Pro 
186 


AAG CCA TGT GCC GAT CTG AGC GCC AAA AAT GAC AGC GAG TCG GGC ATC GAC GAG 
Lys Pro Cys Ala Asp Leu Ser Ala Lys Asn Asp Ser Glu Ser Gly Ile Asp Glu 


GAC TGC CCC AAC AGC GAT GAG GAT TGC CCC AAT GCC AAC CCG GCG GGC ACA TCG 
Asp Cys Pro Asn Ser Asp Glu Asp Cys Pro Asn Ala Asn Pro Ala Gly Thr Ser 


CTC GAG GAC AGC AGC TAC GAG CAG TAT CAG TGC CCC TGG AAG AAG ATA CGC TAT 
Leu Glu Asp Ser Ser Tyr Glu Gln Tyr Gln Cys Pro Trp Lys Lys Ile Arg Tyr 
240 


GCG CGT GAG CTC CTC AAG CAG CGC GAG TTG GAG CAG CAG CAG ACC ACC GGA GGC 
Ala Arg Glu Leu Leu Lys Gln Arg Glu Leu Glu Gln Gln Gln Thr Thr Gly Gly 


AGC AAC GCG CAG CAG CAA GTC GAG GCG AAG CCA GCT GCA ATA CCC ACC AGC AAC 
Ser Asn Ala Gln Gln Gln Val Glu Ala Lys Pro Ala Ala Ile Pro Thr Ser Asn 


ATC AAG CAG CTG CAC TGT GAT AGT CCC TTT TCG GCG CAG ACC CAC AAG GAA ATC 
Ile Lys Gln Leu His Cys Asp Ser Pro Phe Ser Ala Gln Thr His Lys Glu Ile 
294 


GCC AAT CTC CTG CGC CAA CAG TCC CAG CAA CAA CAG GTT GTG GCC ACG CAG CAG 
Ala Asn Leu Leu Arg Gln Gln Ser Gln Gln Gln Gln Val Val Ala Thr Gln Gln 


CAG CAG CAA CAG CAG CAG CAG CAC CAG CAC CAG CAA CAA CGA AGG GAT AGC TCC 
Gln Gln Gln Gln Gln Gln Gln His Gln His Gln Gln Gln Arg Arg Asp Ser Ser 


GAC AGC AAC TGC TCG CTG ATG AGC AAC TCG TGC AAC TCC AGT GCG GGC AAT TGT 
Asp Ser Asn Cys Ser Leu MET Ser Asn Ser Ser Asn Ser Ser Ala Gly Asn Cys 
348 


34 
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-continued 
CAG CAG CTG 
Gln Gln Leu 


TGC ACC 
Cys Thr 
1382 
GAT 
Asp 
TCC 
Ser 


AGC 
Cys 


AAC 
Asn 


GCT 
Ala 


GGC 
Gly 


GAC 
Asp 


GAC 
Asp 


GAG 
Glu 


TCG 
Ser 


CAA 
Gln 


GGC 
Gly 


CTG 
Leu 


TGC 
Cys 


AAT 
Asn 


GAC 
Asp 


TAC 
Tyr 


GAT 
Asp 


CTG 
Leu 


GAA 
Glu 


TGC 
Cys 


CTT 
Leu 


CAG 
Gln 


TGC 
Cys 


AAG 
Lys 


GAG 
Glu 


TTC 
Phe 


CAG 
Gln 


GAT 
Asp 


CAT 
His 
GAG 
Glu 


AGC 
Ser 


1544 


GAT GCC 
Asp Ala 


AAC 
Asn 


AGC 
Ser 


AGC 
Ser 


GCC 
Ala 


AAC 
Asn 


ACG 
Thr 


GGG 
Gly 


AGG 
Arg 


AAC 
Asn 


ACG 
Thr 


CCA 
Pro 


GAT 
Asp 


gtaggtttag 


Methods 

cDNA Libraries 

The X.Dm4925 and }Dm4745 cDNAs were isolated from 
an O" early pupal cDNA library in Agt10 (Poole, S. J., L. M. 
Kauvar, B. Drees, and T. Kornberg, 1985. The engrailed 
locus of Drosophila: structural analysis of an embryonic 
transcript. Cell 40:37-40). The two cDNAs (X.Dm4927 and 
).Dm4928) that were used for 3'-end mapping were isolated 
from an ecdysone-induced salivary gland cDNA library in 
).607 prepared by C. W. Jones. (Our strain collection names 
for the cDNA clones used in these studies are AfDm4925, 
).f£Dm4745, XeDm4927, and XeDm4928.) 

Northern Blot Analysis 

Probes to be used for Northern blots were cloned into the 
vector ppX (from R. Mulligan), containing the $X174 origin 
of replication cloned in between the HindIII and BamHI 
sites of pBR322. This allowed the synthesis of single- 
stranded probe DNA (Arai, K., N. Arai, J. Schlomai, and A. 
Kornberg, 1980. Replication of duplex DNA of phage 
$X174 reconstituted with purified enzymes. Proc. Natl. 
Acad. Sci. 77:3322-3326), which was performed by the 
incubation of supercoiled plasmid DNA with gene A Protein, 
rep and ssb proteins, and DNA polymerase III holoenzyme 
(all generously provided by the A. Kornberg laboratory) in 
a reaction containing 20 mM Tris CI (pH 7.5), 80 ug/ml 
BSA, 4% glycerol, 20 mM DTT, 1 mM ATP, 16 mM 
concentrations of the three unlabeled deoxynucleotides and 
1.6 mM concentrations of the labeled deoxynucleotide for 1 
hour at 30? C. EDTA was then added to 20 mM, SDS to 
0.1%, and proteinase K to 50 ug/ml. The reactions were 
digested for 30 minutes at 37? C., and unincorporated label 
was removed by gel filtration. 

S1 Nuclease Protection and Primer Extension Analysis 

Single-stranded probes, prepared as described above by 
the $X in vitro replication system, were purified by electro- 
phoresis on low melting point agarose gels for use as S1 
probes. All other probes were prepared by extension of the 
-20, 17-mer sequencing primer (New England Biolabs) on 
single-stranded M13mp (Messing, J., 1983. New M13 vec- 
tors for cloning. Methods Enzymol. 101:20-78) or pEMBL 
(Dente, L., G. Cesareni, and R. Cortes, 1983. pEMBL: A 
new family of single-stranded plasmids. Nucleic Acids Res. 
11:1645-1654) recombinant templates using **P-labeled 
nucleotides, followed by cleavage with the appropriate 
restriction enzyme and purification of the probe on denatur- 
ing polyacrylamide gels. Labeled probe (100,000—300,000 
cpm) was incubated with 1 ug of poly(A)+ RNA in a 5 ul 
reaction mixture containing 5 ug of yeast tRNA, 0.4 M 
NaCl, 40 mM PIPES (pH 6.8), and 1 mM EDTA at 60? C. 
under oil. Reactions were cooled and diluted 1:10 into either 
S1 digestion or primer extension buffer. S1 nuclease diges- 
tions were performed in 50 mM acetate buffer (Na), 400 mM 
NaCl, and 4 mM ZnSO, at 20? C. for 1 hour with -15-150 


ATG 
MET 


CAG 
Gln 


CTG 
Leu 


GTA 
Val 
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GAC GAG 
Asp Glu 


GCC 
Ala 


CAC 
His 


CGA CTG 


Arg Leu 


GAC ACG 
Asp Thr 


GAC 
Asp 


GCG 
Ala 


TCC 
Ser 
CTG 
Leu 
402 
GAA 
Glu 


ACA GCT 
Thr Ala 


AAC 
Asn 


Vogt units of S1 nuclease (Boehringer) per 50 ul reaction. 
Primer extensions were performed at 42° C. in 50 mM Tris 
Cl (pH 8.3 at 42? C.), 80 mM KCl, 2 mM DTT, 1 mM of 
dATP, dCTP, dGTP, and dTTP, with 20 units of AMV reverse 
transcriptase (Seikagaku) per 50 ul reaction. Reactions were 
terminated by the addition of EDTA, tRNA carrier was 
added to the S1 nuclease digestions, and samples were 
ethanol-precipitated and either electrophoresed directly on 
5% or 6% denaturing polyacrylamide gels or glyoxalated 
(McMaster, G. K., and G. C. Carmichael, 1977. Analysis of 
single and double-stranded nucleic acids on polyacrylamide 
and agarose gels by using glyoxal and acridine orange. Proc. 
Natl. Acad. Sci. 74:4835—4838) and electrophoresed on 1% 
agarose gels run in 10 mM sodium phosphate buffer (pH 
6.8). 
DNA Sequence Analysis 

The cDNA clones }Dm4927 and XDm4928 were 
sequenced by chemical degradation (Maxam, A. M., and W. 
Gilbert, 1980. Sequencing end-labeled DNA with base- 
specific chemical cleavage. Methods Enzymol. 65:499-560). 
All other sequencing was performed using the dideoxy- 
nucleotide chain termination method (Sanger, E, A. R. 
Coulson, B. F. Barrell, A. J. H. Smith, and B. A. Roe, 1980. 
Cloning in single-stranded bacteriophage as an aid to rapid 
DNA sequencing. J. Mol. Biol. 143:161-178). Fragments 
were cloned into M13mp (Messing, J., 1983. New M13 
vectors for cloning. Methods Enzymol. 101:20—78) or 
pEMBL (Dente, L., G. Cesareni, and R. Cortes, 1983. 
pEMBL: A new family of single-stranded plasmids. Nucleic 
Acids Res. 11:1645-1654) vectors and sequenced directly or 
following the generation of a set of overlapping deletions 
using exonuclease III (Henikoff, S., 1984. Unidirectional 
digestion with exonuclease III creates targeted breakpoints 
for DNA sequencing. Gene 28:351-359). Sequencing was 
performed on both strands of the ADm4925 cDNA, the 
B-specific region of 4Dm4745 cDNA, the A- and B-specific 
5' genomic regions not represented in the cDNAs, and the 
3'-flanking region. The remaining exon boundaries of 
XDm4745 and genomic regions represented within the 
cDNA clones were sequenced on one strand. 
D. The E75 Gene Encodes Two Members of the Steroid 
Receptor Superfamily 

The coding and noncoding sequences of the E75 A and B 
mRNAs, their splice junctions, and the 5' and 3' flanking 
sequences are shown in Table 1. Certain sequences of 
potential interest within the 5' flanking DNA and in the 5' 
leader mRNA sequences are indicated in the legend to Table 
1. We focus here on the large open reading frames of the E75 
A and B mRNAs that begin at 380 bp and 284 bp down- 
stream from their respective mRNA start sites, each con- 
tinuing into the common final exon. The termination codon 
in exon 5 lies upstream of both alternative polyadenylation 
sites; thus, the sequence of the encoded protein is not 
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affected by which site is selected. Since the open reading 
frames in the E75 A and B mRNAs begin in the AO and B1 
exons and merge at the beginning of exon 2, the proteins 
encoded by the two transcription units differ in the amino- 
terminal region and are the same in the carboxy-terminal 
region. The specific amino-terminal regions contain 266 and 
423 amino acid residues in the E75 A and B proteins, 
respectively, while their common carboxy-terminal region 
consists of 971 residues. The predicted molecular weights of 
the A and B proteins are thus 132,000 and 151,000. The open 
reading frames display characteristic D. melanogaster codon 
usage, and their extents have been confirmed by in vitro 
translation of mRNAs transcribed in vitro from cDNA 
constructs and by expression of fusion proteins in E. coli. 
The predicted protein sequence for each protein is punctu- 
ated by homopolymeric tracts of amino acids which are 
noted in Table 1 and its legend. 

Analysis of the sequences of E75 proteins and comparison 
to the sequences of known proteins have revealed similarity 
between the E75 proteins and members of the steroid 
receptor superfamily (Evans, R. M., 1988. The steroid and 
thyroid hormone receptor superfamily. Science 
240:889-895; Green, S., and P. Chambon, 1988. Nuclear 
receptors enhance our understanding of transcription regu- 
lation. Trends in Genetics 4:309-314). We have used the 
nomenclature of Krust el al. (Krust, A., S. Green, P. Argos, 
V. Kumar, P. Walter, J. Bornert, and P. Chambon, 1986. The 
chicken oestrogen receptor sequence: Homology with 
v-erbA and the human oestrogen and glucocorticoid recep- 
tors. EMBO J. 5:891-897) in dividing the proteins into six 
regions, letters A-F, in the amino- to carboxy-terminal 
direction. 

Similarity between E75A and other members of this 
superfamily is strongest in the C region, a cysteine-lysine- 
arginine-rich region that is necessary and sufficient for the 
binding of these receptors to DNA (for review, see, Evans, 
R. M., 1988. The steroid and thyroid hormone receptor 
superfamily. Science 240:889-895; Green, S., and P. 
Chambon, 1988. Nuclear receptors enhance our understand- 
ing of transcription regulation. Trends in Genetics 
4:309-314). The C region consists of 66—68 amino acids, of 
which 20 residues are invariant within this family. Among 
these are nine invariant cysteine residues, eight of which are 
believed to coordinate zinc in the formation of two zinc 
finger-like structures (Miller, J., A. D. McLachlan, and A. 
Klug, 1985. Representative zinc-binding domains in the 
protein transcription factor INA from Xenopus oocytes. 
EMBO J. 4:1609-1614; Freedman, L. P., B. F. Luisi, Z. R. 
Korszun, R. Basavappa, P. B. Sigler, and K. R. Yamamoto, 
1988. The function and structure of the metal coordination 
sites within the glucocorticoid receptor DNA binding 
domain. Nature 334:543-546; Severne, Y., S. Wieland, W. 
Schaffner, and S. Rusconi, 1988. Metal binding finger struc- 
ture of the glucocorticoid receptor defined by site-directed 
mutagenesis. EMBO J. 9:2503-2508). Within the C region, 
E75A contains all of the highly conserved residues and is 
approximately as closely related to other members of the 
steroid receptor superfamily as they are to one another. The 
closest relative of E75 appears to be the human ear-1 gene, 
which has nearly 80% amino acid identity to E75 A in the 
DNA-binding domain. 

The other region conserved among members of the steroid 
receptor superfamily is the E region, which is required for 
steroid binding and for the linkage of steroid-binding and 
trans-activation functions (for review, see, Evans, R. M., 
1988. The steroid and thyroid hormone receptor superfam- 
ily. Science 240:889—895; Green, S., and P. Chambon, 1988. 
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Nuclear receptors enhance our understanding of transcrip- 
tion regulation. Trends in Genetics 4:309—314). Although 
overall E-region similarity is clearly significant for the 
comparison of E75 A to the thyroid hormone, vitamin D, and 
retinoic acid receptors, and ear-1, similarity to the gluco- 
corticoid and estrogen receptors is considerably lower. 
However, the plots of local similarities show a clear simi- 
larity to each of these proteins within three subregions of the 
E region, with we call E1, E2 and E3. The E1 subregion is 
the most highly conserved and corresponds to a region 
shown by in vitro mutagenesis to be essential for steroid 
binding and steroid-dependent trans-activation (Giguere, V., 
S. M. Hollenberg, M. G. Rosenfield, and R. M. Evans, 1986. 
Functional domains of the human glucocorticoid receptor. 
Cell 46:645—652; Danielson, M., J. P. Northrop, J. Jonklaas, 
and G. M. Ringold, 1987. Domains of the glucocorticoid 
receptor involved in specific and nonspecific deoxyribo- 
nucleic acid binding, hormone activation and transcriptional 
enhancement. Mol. Endocrinol. 1:816-822). Region E2 is 
less highly conserved in primary amino acid sequence but 
can, in part, be seen as a conserved hydrophobic region in 
the hydropathy plots of several of these proteins. A deletion 
of 14 amino acids within this region abolished steroid 
binding (Rusconi, S., and K. R. Yamamoto, 1987. Functional 
dissection of the hormone and DNA binding activities of the 
glucocorticoid receptor. EMBO J. 6:1309-1315). E3 falls 
close to the end of the region that is absolutely required for 
steroid binding. 

While the characteristic structural features of the steroid 
receptor superfamily are well conserved in E75, two novel 
variations are seen. The first of these concerns the structure 
of the E75 B protein, which contains a major alteration 
within its putative DNA-binding domain. The steroid recep- 
tor superfamily DNA-binding domain consists of two DNA- 
binding zinc fingers separated by a less conserved linker 
region. In E75, as in nearly all other genes of this family, an 
intron is found in between the two fingers. In E75, this splice 
marks the beginning of the region held in common between 
the E75 A and B proteins. This results in the E75 A protein 
having two fingers, while the E75 B protein has unrelated 
B-specific sequences in place of the first finger. Other 
sequences within the B-specific amino-proximal region may 
contribute to the DNA-binding domain of the E74B protein. 

Alternatively, the B protein might bind DNA with only 
one finger, as GALA transcription factor of yeast appears to 
do. It is possible that these structural differences imply a 
functional difference in the DNA-binding properties of the 
E75 A and B proteins that might allow them to differentially 
regulate the transcription of the late genes that characterize 
the secondary response to ecdysone in different target tis- 
sues. 

In this respect, it should be emphasized that the putative 
hormone- or ligand-binding domain represented by the E 
region that is common to the E75A and E75B proteins. Thus, 
these proteins appear to be receptors for the same hormone 
that may act to regulate the transcription of different sets of 
genes. These proteins represent “orphan” receptors in that 
their hormone, or binding ligand, has not yet been identified. 
Because ecdysteroids are the only known steroid hormones 
in Drosophila, the most obvious candidate for an E75 ligand 
would be ecdysone itself. However, it is unlikely that this is 
the case since the putative hormone-binding domain of the 
E75 proteins does not exhibit the high sequence homology 
to that of the known Drosophila ecdysone receptor encoded 
by the EcR gene (see Experimental Example III and Table 
2) that would be expected if the E75 proteins were also 
ecdysone receptors. It, therefore, seems likely that the E75 


US 6,245,531 B1 


39 


proteins would bind either a terpenoid juvenile hormone or 
a novel Drosophila hormone. 

The second unusual feature of the E75 proteins is the 
presence of a large F region, encompassing nearly one half 
of the proteins. Many of the other receptors have very small 
F regions, and no function has yet been ascribed to this 
region. 

Methods 

Protein Sequence Analysis 

Sequence data were compiled using the Bionet system. 
Protein sequence comparison was performed using FASTP 
(Lipman, D. J., and W. R. Pearson, 1985. Rapid and sensitive 
protein similarity searches. Science 227:1435-1441) and 
Bionet IFIND programs. 

E. Expression Vectors for E75 Proteins 

In order to express the E75 proteins, portions of cDNAs 
and genomic clones were fused in order to generate cassettes 
containing the entire E75 A and E75 B protein coding 
regions. First, BamHI sites were introduced into genomic 
clones upstream of the initial AUGs of the large open 
reading frames. Then, E75 AO exon sequences were fused to 
sequences of a nearly full-length E75 A cDNA, and E75 B1 
exon sequences were fused to sequences of a nearly full- 
length E75 B cDNA. These cassettes were cloned into 
pGEM3 (Promega), and transcripts of the open reading 
frames were prepared using T7 polymerase. These were then 
translated in the presence of ??S-methionine, and shown to 
give rise to proteins of appropriate size. 

These cassettes have been placed into a variety of expres- 
sion vectors, including pUCHsneo/Act for expression in 
Drosophila cells, pSV2 for expression in mammalian cells, 
and pOTS for expression in bacterial cells. 

Methods 

BamHI sites were introduced directly upstream of the 
initial ATGs of the E75A and E75B coding sequence—at the 
Sspl site upstream of the E75A initial ATG, and at the SacII 
site upstream of the E75B initial ATG. cDNA and genomic 
sequences were joined at the EcoRV site in the AO exon to 
construct an E75A cassette, and at the Mlul in exon 3 to 
construct an E75B cassette. 


EXAMPLE II 


CLONING, STRUCTURE AND EXPRESSION 
OF THE EcR AND DHR3 GENES THAT 
ENCODE ADDITIONAL MEMBERS OF THE 
STEROID RECEPTOR SUPERFAMILY 


The following experiments were carried out after the 
primary structure of the E75 gene, and of the two members 
of the steroid receptor superfamily that it encodes, was 
determined (Experimental Example I). The purpose of these 
experiments was to clone and determine the primary struc- 
ture of other steroid receptor superfamily genes from 
Drosophila, and of the proteins they encode, with the aim of 
identifying the gene that encodes a Drosophila ecdysone 
receptor, given that the characteristics of the E75 gene 
indicated that it did not encode an ecdysone receptor. The 
first stage of the experimental plan was to use the conserved 
sequences in the E75A transcription unit that encode the 
putative DNA-binding domain of the E75A receptor protein 
as a probe to screen a Drosophila genomic library of cloned 
DNA segments to identify segments containing sequences 
encoding the putative DNA-binding domains of other 
Drosophila members of the steroid receptor superfamily. 
The second stage was to isolate cDNA clones from the 
identified genes, as well as additional genomic DNA clones, 
to obtain the nucleotide sequence of the complete coding 
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region (i.e., the open reading frame encoding the respective 
receptors) and the exon-intron organization of these genes. 

The experiments described below resulted in the cloning 
and structural characterization of two genes that satisfy the 
criteria for bona fide members of the steroid receptor super- 
family: encoding proteins that exhibit amino acid sequence 
homology to both the DNA-binding and the hormone- 
binding domains that are conserved among members of this 
superfamily. The two genes are called EcR and DHR3. The 
EcR gene was originally called DHR23, but was renamed 
EcR after it was shown to encode an ecdysone receptor (see 
Experimental Example IIT). The DHR3 designation stands 
for Drosophila Hormone Receptor 3. 

A. Identification and Chromosomal Mapping of EcR and 
DHR3 Genomic Clones 

Initially, Southern blots of total Drosophila genomic 
DNA, digested with one or another of several restriction 
endonucleases, were probed with a 530 bp fragment of the 
E75A cDNA containing the sequences encoding the putative 
DNA-binding domain of the E75A receptor protein (see 
Experimental Example I) at low and high stringency hybrid- 
ization conditions. 

To isolate the sequences responsible for these low strin- 
gency bands, this E75A probe was used to screen a Droso- 
phila genomic library under the same low stringency 
conditions, counterscreening duplicate filters with E75 
intron probes to eliminate phage-containing inserts from the 
E75 gene. Five genome equivalents were screened and 39 
non-E75 containing phage were isolated. The 25 most 
strongly hybridizing clones were divided into six classes on 
the basis of restriction mapping and cross hybridization, 
each class containing a set of between one and six indepen- 
dent overlapping genomic inserts. 

For each class, a restriction fragment containing the 
region of hybridization to the E75A probe was localized by 
Southern blotting. Hybridization of probes derived from 
these fragments to genomic Southern blots showed that each 
of the low stringency bands detectable by the E75A probe 
could be accounted for by one of the six isolated fragments. 

The nucleotide sequences of the six restriction fragments 
were determine to test whether they represent candidate 
receptor genes. In all cases, DNA sequence similarities with 
the E75A probe were observed that are sufficient to account 
for the hybridization of these fragments with the probe. 
When the DNA sequences were conceptually translated in 
all six reading frames, four of the fragments yielded no 
significant sequence similarity with E75A at the protein 
level. The remaining two clones, however, showed predicted 
amino acid sequences with strong similarity to the DNA 
binding domains of the E75A protein and other steroid 
superfamily receptors. 

These two clones represent the EcR and DHR3 genes, as 
will become apparent. Probes from these clones were used 
to map the position of these genes in the polytene chromo- 
somes by in situ hybridization. The EcR and DHR3 chro- 
mosomal loci were mapped to positions 42A and 46F, 
respectively, in the right arm of the second chromosome. 
B. Structure of the EcR and DHR3 Genes and their cDNAs 

The DHR3 and EcR genomic clones described above 
were used to screen a cDNA library prepared from third 
instar tissues treated with ecdysone and cycloheximide. This 
allowed the isolation of a large number of cDNA clones 
since both genes have a peak period of transcription in late 
third instar after the rise in ecdysone titer. For each gene, 20 
cloned cDNAs were purified and their lengths determined. 
Restriction maps for the 10 longest cDNAs from each gene 
were determined and found to be colinear. 
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For EcR, a 5534 bp cDNA sequence was obtained from 
two overlapping cDNA clones. It contains an 878 codon 
open reading frame (ORF) which yields a predicted amino 
acid sequence expected for a member of the steroid receptor 
superfamily (Table 2), as described in more detail below. 
The length of the largest DHR3 cDNA that was isolated 
(clone DHR3-9) is 4.2 kb. The nucleotide sequence of this 
cDNA was determined and found to contain a 487 codon 
AUG-initiated open reading frame (Table 3). As described 
below, the amino acid sequence of the DHR3 protein pre- 
dicted from this sequence demonstrates that this protein is 
also a bona fide member of the steroid receptor superfamily. 
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Table 2. The cDNA sequence of the EcR gene. Numerals 
at the left refer to the nucleotide sequences; those on the 
right to the amino acid sequence in the EcR protein. Nucle- 
otides 1—5194 are the sequence of EcR-17 cDNA, while 
nucleotides 5195—5534 derive from the EcR-9 cDNA. The 
underlined sequences in the 5' and 3' untranslated regions 
refer, respectively, to the ATG codons and the AATAAA 
consensus polyadenylation signals. Positions of the introns 
and the donor and acceptor splice sequences are indicated 
above the cDNA sequence in small type. The amino acid 
sequences homologous to the conceived DNA-binding (C 
region) and hormone-binding (E region) domains of the 
steroid receptor superfamily are underlined. 
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Table 3. The cDNA sequence of the DHR3 gene. The 
numbering and underlining of the nucleotide and amino acid 
sequences have the same meaning as in Table 2, and the 
intron positions and donor and acceptor splice sequences are 
similarly indicated. The sequence of the 5' proximal 2338 


50 
nucleotides of the DHR3-9 cDNA is shown. The sequence 
of the remainder of this 4.2 kb cDNA was determined for 
only one strand and is not shown. Four silent, third-position 
differences between the cDNA and genomic DNA sequences 
are indicated above the cDNA sequence. 
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The genomic structure of the EcR and DHR3 genes was 
investigated by isolating additional genomic DNA clones 
that form overlapping sets that contain all of the sequences 
found in the respective cDNA clones. The exons contained 
in these cDNAs were mapped within the genomic DNA by 
comparison of cDNA and genomic clones via Southern blot 
analysis, mapping of restriction cleavage sites, and finally, 
by determination of the nucleotide sequence of the genomic 
DNA in regions that contain the exon/intron boundaries. 
Table 2 and 3 show these boundaries and the sequence of the 
splice junctions for the EcR and DHR3 genes, respectively. 
All of these splice junctions conform to the splice donor and 
acceptor consensus sequences. 


For EcR, the cDNA sequence shown in Table 2 is split 
into six exons spread over 36 kb of genomic DNA, with the 
ORF beginning in the second exon and ending in the sixth. 
For DHR3, the cDNA sequence derives from nine exons 
spread over 18 kb, with the ORF beginning in the first exon 
and ending in the ninth. Because the 5' and 3' ends of the 
respective mRNAs were not mapped, it should be empha- 
sized that these genes may have additional noncoding exons 
at their 5' or 3' ends. 


The EcR and DHR3 gene structures differ significantly 
from those of all previously examined steroid receptor 
superfamily genes comparison with the genes for 11 other 
receptor homologues for which at least partial structural 
information is available reveals that the positions of certain 
exon boundaries have been conserved in evolution. This 
conservation is most striking in the portion of the genes 
encoding DNA-binding domains. In the nine other cases 
where the structure of this region has been examined, the 
two halves of the DNA-binding domain are always encoded 
by separate exons. If we exclude the Drosophila genes 
knirps, knirps-related, and egon (which are not bona fide 
receptor homologues since they lack the hormone-binding 
domain sequence similarity), these are always small exons, 
the second one invariably ending in the fourth codon past the 
conserved Met codon at the end of the C region. Thus, these 
exons each encode one of the two predicted Zn fingers of 
the, DNA-binding domain. In contrast, both Zn fingers of the 
putative DNA-binding domain of the EcR and DHR3 recep- 
tors are encoded by a single exon. It is possible that our 
screen specifically selected for genes lacking the above 
intron. The screen selected genomic clones that hybridize to 
an E75A cDNA probe that, of course, lacks this intron. 
Genomic sequences containing a contiguous sequence 
encoding the DNA-binding domain would be expected to 
hybridize to this probe better than clones from genes con- 
taining the intron. This would explain the successful isola- 
tion of the EcR and DHR3 genes, and the failure to isolate 
the genes of other Drosophila members of the steroid 
receptor superfamily. 


Methods 
Isolation of cDNA and Additional Genomic Clones 


Subclones of the originally isolated DHR3 and EcR 
genomic clones were used to screen a cDNA library pre- 
pared from third instar tissues treated with ecdysone and 
cycloheximide. This library was chosen because both genes 
are relatively highly expressed at the end of third instar, and 
because of the high quality of the library. Of the 270,000 
primary plaques screened, 20 positives for DHR3 and 220 
for EcR were detected. Twenty cDNAs for each gene were 
purified, of which the ten largest for each were restriction 
mapped and found to be colinear. cDNA DHR3-9, which 
extends farther both 5' and 3' than our other DHR3 cDNAs, 
was chosen for sequencing. For EcR, the longest cDNA, 
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EcR-17, extended the farthest 5' and was sequenced in its 
entirety. An additional cDNA clone, EcR-9, was found to 
extend 300 bp farther 3' than EcR-17, and this 3' extension 
was also sequenced. Additional genomic DNA clones cov- 
ering the EcR and DHR3 genes were obtained by screening 
the Drosophila Canton S genomic library referred to in part 
A above either with probes from the respective cDNA 
clones, or for overlapping clones by the chromosomal walk 
method described in Experimental Example I. 


DNA Sequence Analysis 


cDNAs were subcloned into BlueScript vectors (Strat- 
agene), and clones for sequencing were generated by exo- 
nuclease III digestion (Henikoff, S., 1984. Unidirectional 
digestion with exonuclease III creates targeted breakpoints 
for DNA sequencing. Gene 28:351—359). 


Double-stranded plasmids were denatured (Gatermann, 
K. B., G. H. Rosenberg, and N. F. Kaufer, 1988. Double- 
stranded sequencing, using mini-prep plasmids, in 11 hours. 
BioTechniques 6:951-952) and sequenced by the dideoxy 
chain terminating method (Sanger, F., S. Nicklen, and A. R. 
Coulson, 1977. DNA sequencing with chain-terminating 
inhibitors. Proc. Natl. Acad. Sci. USA 74:5463—5467), using 
the enzyme Sequenase (U.S. Biochemical). cDNA EcR-17 
was completely sequenced on both strands, as was the EcR-9 
3' extension. cDNA DHR3-9 was sequenced on both strands 
for the 5' most 2338 bp, which contains the entire ORF, and 
the remainder of the long 3' untranslated region was 
sequenced on one strand. 


The exon/intron boundaries in genomic DNA clones were 
first mapped at low resolution by Southern blot analysis of 
their restriction fragments probed with labeled cDNAs. 
Genomic DNA surrounding each exon/intron boundary was 
subcloned and the nucleotide sequence of these subclones 
determined as above. 


Genomic exons were either sequenced entirely, or for the 
longer exons, were digested and electrophoresed in parallel 
with cDNA clones to confirm the colinearity of the genomic 
and cDNA clones. Shorter exons were completely 
sequenced from genomic clones. Longer exons had their 
boundaries sequenced from genomic clones, and were con- 
firmed to be colinear with the cDNA clones by parallel 
digestion and electrophoresis of the cDNA and genomic 
clones. 

C. The Predicted Amino Acid Sequence of the EcR and 
DHR3 Proteins and their Implications 

Comparison of the predicted EcR and DHR3 protein 
sequences to the sequence database and to individual mem- 
bers of the steroid receptor superfamily shows that these 
proteins share the two conserved domains characteristic of 
this superfamily (Evans, R. M., 1988. The steroid and 
thyroid hormone receptor superfamily. Science 
240:889—895; Green, S., and P. Chambon, 1988. Nuclear 
receptors enhance our understanding of transcription regu- 
lation. Trends in Genetics 4:309-314). We refer to the 
domains as the C and E regions, for the more amino-terminal 
and more carboxy-terminal homologies, respectively, 
according to the nomenclature of Krust et al. (Krust, A., S. 
Green, P. Argos, V. Kumar, P. Walter, J. M. Bornert, and P. 
Chambon, 1986. The chicken oestrogen receptor sequence; 
homology with v-erbA and the human oestrogen and glu- 
cocorticoid receptors. EMBO J. 5:891-897). These domains 
are underlined in Tables 2 and 3, and Table 4A-C presents 
a comparison of these domains from EcR and DHR3 with 
those from representative members of the superfamily. 

Table 4. Sequence comparison of the conserved C and E 
regions in DHR3, EcR, and some representative nuclear 
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receptor homologues. (A) C-region alignment. Numbers at 
the left indicate the amino acid positions within the indi- 
vidual receptors; dashes indicate gaps introduced to obtain 
maximal alignment. Dots indicate three positions important 
in determining the DNA binding specificity of this domain. 
(B) E-region alignment. Bars indicate the three most highly 
conserved stretches within this domain. (C) Computed per- 
cent identifies among the C-region sequences (lower left) 
and among the E-region sequences (upper right). The kni 
sequence shows no significant E-region homology and is, 
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therefore, not included in this comparison. Sequences shown 
are from: E75A, Drosophila ecdysone-inducible gene at 
75B; kni, Drosophila segmentation gene knirps; hRARa, 
human retinoic acid receptor alpha; htRQ, human thyroid 
receptor beta; hVDR, human vitamin D receptor; cCOUP-TF, 
chicken ovalbumin upstream promoter transcription factor; 
hERR1 and hERR2, human estrogen-related receptors 1 and 
2; hER, human estrogen receptor; hGR, human glucocorti- 
coid receptor; hMR, human mineralocorticoid receptor; 
hPR, human progesterone receptor. 
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The C region is a 66—68 amino acid domain that has been 
shown to function as a Zn finger DNA binding domain in 
vertebrate receptors. This domain has also been implicated 
in receptor dimerization (Kumar, V., and P. Chambon, 1988. 
The estrogen receptor binds tightly to its responsive element 
as a ligand-induced homodimer. Cell 55:145-156). As 
shown in Table 4A, all 19 C-region residues that are abso- 
lutely conserved in the other receptor homologues are also 
conserved in DHR3 and ECR, including the nine invariant 
Cys residues, eight of which coordinate two zinc ions 
(Freedman, L. P., B. F. Luisi, Z. R. Korszun, R. Basavappa, 
P. B. Sigler, and K. R. Yamamoto, 1988. The function and 
structure of the metal coordination sites within the gluco- 
corticoid receptor DNA binding domain. Nature 
334:543-546). As seen in Table 4C, the Drosophila C-region 
sequences (including those of E75A) are not more closely 
related to each other than they are to those from the 
vertebrate receptor homologues. The C region of DHR3 is 
most similar to that of the human retinoic acid receptor a 
(hRARo), and the C region of EcR is most similar to that of 
the human thyroid receptor B (hTRf). Studies on the human 
glucocorticoid receptor (hGR) and human estrogen receptor 
(hER) have identified three C-region residues (indicated by 
dots in Table 4A) that are critical for determining the 
differential DNA binding specificity of these receptors 
(Mader, S., V. Kumar, H. de Verneuil, and P. Chambon, 
1989. Three amino acids of the oestrogen receptor are 
essential to its ability to distinguish an oestrogen from a 
glucocorticoid-responsive element. Nature 338:271-274; 
Umesono, K., and R. M. Evans, 1989. Determinants of 
target gene specificity for steroid/thyroid hormone receptors. 
Cell 57:1139—46). The three Drosophila proteins DHR3, 
EcR, and E75A, as well as the vertebrate receptors hRRARa, 
hTRf, and the human vitamin D receptor (hVDR), all have 
identical amino acids at these three positions; thus, these 
proteins may all have similar DNA binding specificities, as 
has already been shown for hRARa and hTR£ (Umesono, 
K., V. Giguere, C. K. Glass, M. G. Rosenfeld, and R. M. 
Evans, 1988. Retinoic acid and thyroid hormone induce 
gene expression through a common responsive element. 
Nature 336:262—265). 


The E-region is an ~225 amino acid domain that functions 
as a hormone-binding domain in vertebrate receptors. This 
domain has also been implicated in hormone dependent 
receptor dimerization (Kumar, V. and P. Chambon, 1988. 
The estrogen receptor binds tightly to its responsive element 
as a ligand-induced homodimer. Cell 55:145-156; Guio- 
chon, M. A., H. Loosfelt, P. Lescop, S. Sar, M. Atger, A. M. 
Perrot, and E. Milgrom, 1989. Mechanisms of nuclear 
localization of the progesterone receptor: evidence for inter- 
action between monomers. Cell 57:1147-1154), hormone 
dependent nuclear localization of the glucocorticoid recep- 
tor (Picard, D., and K. R. Yamamoto, 1987. Two signals 
mediate hormone-dependent nuclear localization of the glu- 
cocorticoid receptor. EMBO J. 6:3333-3340), and binding 
of the glucocorticoid receptor to the 90 kDa heat shock 
protein (Pratt, W. B., D. J. Jolly, D. V. Pratt, W. M. 
Hollenberg, V. Giguere, F. M. Cadepond, G. G. Schweizer, 
M. G. Catelli, R. M. Evans, and E. E. Baulieu, 1988. A 
region in the steroid binding domain determines formation 
of the non-DNA-binding, 9 S glucocorticoid receptor com- 
plex. J. Biol. Chem. 263:267—273). Table 4B shows an 
alignment of the E regions of the DHR3 and EcR proteins 
with those of other receptor homologues. The three rela- 
tively highly conserved stretches within this region noted in 
Experimental Example I are overlined; each contains a 
cluster of residues conserved in all or most of the receptor 
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sequences. DHR3 and EcR show strong similarity to each 
other and to the other proteins in these stretches, and a lower 
similarity outside of them. The presence of this E-region 
homology establishes these proteins as bona fide members 
of the nuclear receptor family, in contrast to the Drosophila 
knirps (Nauber, U., M. J. Pankratz, A. Kienlin, E. Seifert, U. 
Klemm, and H. Jackle, 1988. Abdominal segmentation of 
the Drosophila embryo requires a hormone receptor-like 
protein encoded by the gap gene knirps. Nature 
336:489-492), knirps-related (Oro, A. E., E. S. Ong, J. S. 
Margolis, J. W. Posakony, M. McKeown, and R. M. Evans, 
1988. The Drosophila gene knirps-related is a member of the 
steroid-receptor gene superfamily. Nature 336:493—496), 
and egon (Rothe, M., U. Nauber, and H. Jackle, 1989. Three 
hormone receptor-like Drosophila genes encode an identical 
DNA-binding finger. EMBO J. 8:3087-3094) proteins, 
which show C-region homology but no E-region homology. 
The E region in DHR3 is most similar to that of E75A, and 
the E region of EcR is most similar to that of hTRp, although 
the level of these similarities is lower than those found 
among E regions of many other receptors (Table 4C). Thus, 
DHR3 and EcR are not especially close homologues of any 
previously cloned receptors. Comparison of E-region 
sequences allows division of the nuclear receptors into 
subfamilies (Petkovich, M., N. J. Brand, A. Krust, and P. 
Chambon, 1987. A human retinoic acid receptor which 
belongs to the family of nuclear receptors. Nature 
330:444-450), the members of any one subfamily being 
more related to each other than to those in other subfamilies. 
The DHR3 and EcR receptors fall into a subfamily with the 
E75A, E75B, hRARa, hTRB and hVDR receptors. 

D. In Situ Labeling of the EcR and DHR3 Proteins with 
Antibodies Induced by Proteins Produced in E. coli 

To determine the intracellular and tissue distribution of 
the EcR and DHR3 proteins in Drosophila, affinity-purified 
polyclonal antibodies directed against those proteins were 
produced in the following manner. The region of about 120 
amino acid residues that is located between the conserved 
DNA-binding and hormone-binding domains of these pro- 
teins was used as the immunogen to produce antibodies 
against each protein. Thus, the coding sequences for amino 
acids 335-447 of the EcR protein and for amino acids 
164-289 of the DHR3 protein (see Tables 2 and 3, 
respectively) were cloned into the appropriate pATH 
(Dieckmann, C., and A. Tzagaloff, 1985. J. Biol. Chem. 
260:1513-1520) or pUR expression vectors, so as to fuse 
these coding sequences to those encoding E. coli 
B-galactosidase (Bgal) or to E. coli tryptophan E protein 
(trpE), respectively. 

The gal fusion proteins were produced in E. coli by the 
addition of the IPTG inducer to exponential cultures, while 
the production of trpE fusion proteins were induced by 
dilution into tryptophan-free media and subsequent addition 
of indoleacetic acid. For EcR, the trpE fusion protein was 
used as an immunogen and the f)gal fusion protein was used 
on immunoblots to test sera for immunoreactivity to the EcR 
portion of the fusions. For DHR3, the gal fusion protein 
was injected, and sera were checked against the trpE fusion 
protein. 

For immunization the appropriate fusion protein was 
prepared by electrophoresis in SDS-PAGE gels and visual- 
ized by staining in ice-cold 0.25 M KC,, after which the 
fusion protein band was cut out. Approximately 100 ug of 
fusion protein in 0.25 ml of gel slice was crushed by passing 
through successively smaller hypodermic needles, and 
mixed with 0.25 ml of a sterile saline solution and 0.5 ml of 
Freund's complete adjuvant. For each immunogen, two New 
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Zealand White rabbits were injected at multiple intramus- 
cular sites, and after one month, boosted at two-week 
intervals, omitting the Freund’s adjuvant. While the gal 
fusion proteins were subject to the above gel electrophoresis 
without prior purification, the trpE fusion proteins were first 
purified by the following method which takes advantage of 
their insolubility in vivo. 

E. coli from a 2-liter culture of induced cells were washed, 
and the cell pellet was subjected to several freeze/thaw 
cycles. The cells were resuspended in 18 ml of 50 mM Tris 
HC1, pH 7.5, 0.5 mM EDTA, and 1.8 ml of 10 mg/ml 
lysozyme was added. After 15 minutes on ice, the cells were 
lysed by passing three times through a french pressure cell 
at 10,000 psi. The insoluble fraction was collected by 
centrifugation at 27,000xg for 15 minutes, and washed by 
resuspension, using a Dounce homogenizer, in ice-cold 50 
mM Tris HCl, 0.5 mM EDTA, 0.3 M NaCl, followed by 
centrifugation as above. The washing step was repeated, and 
the final pellet dissolved in 10 ml of 4M urea, 2% (w/v) SDS, 
50 mM Tris HCl, pH 7.5, 1 mM EDTA, 5% (v/v) 
2-mercaptoethanol. Material remaining insoluble was cen- 
trifuged out and discarded. 

The antisera were affinity purified in a two-step procedure 
by successively passing the antibodies through “nonspe- 
cific" and "specific" affinity columns. In the case of anti- 
bodies raised against the trpE fusion proteins, the nonspe- 
cific column consisted of resin coupled to the insoluble 
protein derived from E. coli expressing unmodified trpE 
protein, and was used to remove antibodies directed against 
trpE epitopes, as well as against insoluble E. coli protein 
impurities. The specific column consisted of resin coupled to 
the EcR-trpE fusion protein (purified as described above) 
and was used to absorb the desired antibodies directed 
against the EcR epitopes, antibodies that were subsequently 
released from the column. In the case of antibodies raised 
against the Bgal fusion proteins, the same general procedure 
was used, except that the resin in the nonspecific column 
was coupled to f-galactosidase, while that in the specific 
column was coupled to the DHR3-pgal fusion protein. 
Western blot analysis of the appropriate E. coli extracts 
demonstrated that these affinity-purified antibodies exhib- 
ited the desired specificity. 

The intracellular distribution of the EcR protein in late 
third instar salivary glands was examined by in situ labeling 
of this protein with the anti-EcR antibody. The EcR protein 
was thereby shown to be highly localized in the nuclei of 
these glands. Indeed, when the polytene chromosomes in 
these nuclei were examined by the antibody-labeling method 
of Zink and Paro (Zinc, B., and R. Paro, 1989. Nature 
337:468-471), specific loci within these chromosomes 
exhibited strong binding of the EcR protein. In particular, 
the EcR protein was bound to the early puff loci, including 
those occupied by the E75 and E74 genes. This is the result 
expected if the ecdysone receptor encoded by the EcR gene 
is that which induces the transcription of the early genes, as 
anticipated by the Ashburner model. Another prediction of 
the Ashburner model is that the ecdysone-receptor complex 
initially represses the genes responsible for the later puff, so 
that the transcription of the late genes induced by the early 
gene proteins is delayed until these proteins accumulate 
sufficiently to overcome this initial repression. If the EcR 
receptor is involved in this postulated initial repression, then 
one would expect the EcR protein to bind to the late puff loci 
in the salivary glands. This expectation was met by the 
observation that EcR protein also binds to the late puff loci 
in the polytene chromosomes. 

Additional in situ antibody labeling experiments demon- 
strated that the EcR protein is present in the nuclei of all 
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ecdysone target tissues examined in late third instar larvae. 
It is also present in most, if not all, cells during embryo- 
genesis and other stages of Drosophila development that 
have been examined. In this respect, the EcR protein was not 
detected by anti-EcR antibody labeling of embryos in which 
the EcR gene was eliminated by a chromosomal deletion, 
further demonstrating the specificity of this antibody. 

In contrast to the widespread distribution of the EcR 
protein, anti-DHR3 antibody labeling of embryos demon- 
strated that the distribution of the DHR3 protein is highly 
restricted during this stage of development. During the brief 
embryonic period of expression, the protein is restricted to 
the peripheral nervous system, and to cells surrounding the 
spiracles at the posterior end of the embryo. 

Finally, it should be noted that affinity-purified antibodies 
against the E75A protein have also been prepared by the 
same technique described above for anti-EcR and anti- 
DHR3 antibodies. In situ antibody labeling of the E75A 
protein in larval salivary glands has also demonstrated that 
this protein is localized in the nucleus and is bound to 
specific loci in the polytene chromosomes. 


EXAMPLE III 


THE ECDYSTEROID-BINDING, DNA-BINDING 
AND GENETIC REGULATORY PROPERTIES 
OF THE EcR PROTEIN DEMONSTRATE THAT 
IT IS AN ECDYSONE RECEPTOR 


The following experiments demonstrate that the protein 
encoded by the EcR gene is an ecdysone receptor by the 
following three criteria. (1) The EcR protein binds ecdys- 
teroids and accounts for a large proportion, if not all, of the 
ecdysteroid-binding activity present in Drosophila embryos 
and in a variety of cultured Drosophila cells. (2) The EcR 
protein binds with high specificity to a DNA sequence that 
functions as an ecdysone response element (EcRE), i.e., an 
enhancer that confers ecdysone inducibility to a promoter. 
(3) Cells that do not respond to ecdysone because they lack 
functional ecdysone receptors are transformed to the 
ecdysone-responsive state by transfection, with an EcR 
expression plasmid. 

A. The EcR Protein Binds Ecdysteroids 

The EcR expression plasmid, pMTEcR, shown in FIG. 1 
contains the open reading frame encoding the EcR protein 
(EcR ORF; see Experimental Example II) fused to the 
Drosophila metallothionine promoter (P,;;) at its 5' end, and 
the polyadenylation-cleavage sequences of the Drosophila 
Actin 5C gene at its 3' end. Because transcription of the EcR 
ORF is under control of this metallothionine, that transcrip- 
tion is induced by Cu?* ion to yield an mRNA that, in turn, 
yields the EcR protein. A cell line, MtEcRHy, that overpro- 
duces this protein upon CU?* induction, as determined by 
Western blot analysis using the affinity-purified anti-EcR 
antibody (see Experimental Example II), was constructed by 
the stable integration of the pMTEcR plasmid DNA into the 
genome of Drosophila Sch-2 cell line. A control cell line, 
MtHy, was similarly constructed by the integration of the 
expression vector DNA lacking the EcR ORF. 

Whole cell extracts were prepared from both the MtEc- 
RHy and MtHy cell lines after Cu?* induction, and were 
assayed for ecdysteroid-binding activity using the high 
affinity ecdysone analogue [I] iodoponasterone A. The 
MtEcRHy extract contained sevenfold more saturable 
ecdysteroid-binding activity than the MtHy control extract. 

To see if the induced ecdysteroid-binding activity was due 
to the EcR polypeptide itself, the EcR protein was depleted 
from the MtEcRHy extract by immunoprecipitation using an 
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affinity-purified anti-EcR polyclonal antibody, or, as a 
control, the extract was mock-depleted with preimmune 
serum. The treated extracts were then assayed for 
ecdysteroid-binding activity. Comparison of the immuno- 
depleted extract with the mock-depleted extract showed that 
most of the binding activity was removed by the anti-EcR 
antibody treatment, indicating that the induced ecdysteroid- 
binding activity results from the EcR protein. 

The endogenous ecdysteroid-binding activity in the con- 
trol cell line, MtHy, was unchanged by Cu?* exposure, and 
was approximately the same as that in the Sch-2 cell from 
which it derives. The question arises as to whether the 
endogenous activity in these and other Drosophila cell lines, 
as well as in embryonic extracts, results from the expression 
of the EcR gene in their respective genomes. To answer this 
question, extracts from embryos and several cell lines were 
immuno-depleted and mock-depleted, as described above, 
and assayed for ecdysteroid-binding activity. Again, com- 
parison of these treated extracts showed that the large 
majority of the endogenous binding activity was removed in 
each case by treatment with the anti-EcR antibody. Thus, it 
appears that most, if not all, of the endogenous binding 
activity in embryos and cell lines results form the resident 
EcR gene. 

Methods 

Extracts 

Tissue culture cell extracts for hormone and DNA- 
binding experiments were prepared as follows. Cells were 
grown in spinner flasks to a density of 5-7x10° cells/ml, and 
were washed once in EcR buffer (25 mM Hepes, pH 7.0, 40 
mM KCl, 10% (v/v) glycerol, 1 mM EDTA, 1 mM 
dithiothreitol, and the following cocktail of protease inhibi- 
tors: 10 mM Na,S,0;, 500 uM PMSF, 1 uM leupeptin, 1 uM 
pepstatin). All further manipulations were at 4? C. Cells 
were resuspended in EcR buffer at 2% of the original culture 
volume, divided into 3 ml aliquots, and sonicated using 30% 
second pulses with a probe sonicator (Bronson Sonifier 
450), resulting in disruptions of -95% of the cells. After 
centrifugation at 100,000xg for 1 hour, 100 wl aliquots of 
supernatant were frozen in liquid nitrogen, and stored at 
-80° C. Protein concentration was determined using bone 
serum albumin as the standard, and was typically 6-11 
mg/ml. Embryo extracts were prepared by a similar proto- 
col: 3-6 hour Canton S embryos were dechorionated in 55% 
commercial bleach for 2 minutes, washed extensively in 
0.7% NaCl, and resuspended using 2 grams of embryos per 
ml of EcR buffer. Embryos were broken with 20 strokes in 
a Dounce homogenizer using a B pestle, and lysis was 
completed with the probe sonicator using the same settings 
as used for the tissue culture cells. The extract was adjusted 
to 400 mM KCI, centrifuged 1 hour at 100,000xg, and 
aliquots of supernatant were frozen. This extract contained 
13.4 mg/ml protein. Before use in hormone binding, it was 
diluted tenfold in EcR buffer lacking KCl to bring the final 
KCI concentration to 40 mM. 

Hormone-binding Assays 

For hormone-binding experiments, extracts were first 
diluted to the following concentrations in EcR buffer: 0.9 
mg/ml for MtHy and MtEcRHy extracts, 3 mg/ml for S2 and 
SRS 1.5 extracts, 4 mg/ml for the Kc cell extracts, and 1.3 
mg/ml for the embryo extract. All manipulations were done 
on duplicate samples in order to quantify variability in the 
results. For immunoprecipitation experiments, extracts were 
immunodepleted, mock-depleted, or left untreated. For 
depletions, 300 ul of diluted extract was incubated for 30 
minutes at 25? C. with 3.5 ul affinity-purified anti-EcR 
antibody, or with 3.5 ul preimmune serum for the mock- 
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depletion control. Then 38 ul 10% Staphylococcus aureus 
(Pansorbin, Calbiochem) in EcR buffer was added, and 
incubation was continued for 15 minutes at 25? C. After 
centrifugation for 3 minutes in a microcentrifuge, the super- 
natant (depleted extract) was recovered. The immunopre- 
cipitation was repeated, except in the case of the embryo 
extract which was subjected to only one round of precipi- 
tation. The *untreated" extract aliquots were left at 4? C. for 
the duration of the depletion procedure, and were diluted 
with EcR buffer to match the final concentration of the 
depleted aliquots. 

The [*7°I] iodoponasterone was supplied by P. Cherbas, 
and a modification of his hormone-binding assay was used 
(Cherbas, P. 1988. Proc. Nat’l Acad. Sci. U.S.A. 
85:2096—2100). Assay tubes contained 140 ul extract, 14 ul 
[#1] iodoponasterone, and either 14 ul EcR buffer or 14 ul 
unlabelled 20-OH ecdysone in EcR buffer as a competitor. 
[#1] iodoponasterone was 2177 Ci/mM and was used at a 
final concentration of 5x1071? M in the assay; 20-OH 
ecdysone was 2x10^? M final concentration in the assay. 
After incubation for 1 hour at 25? C., each reaction was 
spotted on a dry Whatman GF/C filter (2.4 cm), and after 30 
seconds the filter was washed by using a vacuum to draw 10 
ml EcR buffer through the filter over a period of 1 minute. 
Filters were placed in 800 xl 4% SDS, and radioactivity was 
measured in a y counter. The hormone-binding activities 
shown are saturable binding activities, calculated as the total 
binding activity, as measured in assays with no added 
competitor, minus the unsaturable binding activity, mea- 
sured in the assays with excess unlabelled ecdysone added. 
In the most active extracts, the unsaturable activity 
(representing the large number of low affinity binding sites 
in the extract) was less than 10% of the total activity. 

B. Genetic Regulatory Activity of the EcR Protein in vivo 

An ecdysone-inducible reporter plasmid, pEcRE/Adh/ 
Bgal (FIG. 2), was constructed to test the regulatory func- 
tions of the EcR protein in vivo. The reporter gene in this 
plasmid consists of the sequence that encodes the E. coli 
B-galactosidase (Bgal ORF) linked through the 5' leader 
sequence of the Drosophila Ultrabithorax gene (UBX leader 
and AUG) to an ecdysone-inducible promoter. This pro- 
moter was created by fusing a truncated version of the 
proximal promoter for the Drosophila Adh gene 
(PpAan-34453» the numbers indicating that it consists of the 
sequence from base pair positions —34 to +53, which just 
includes the TATA box) to seven repeats of a 34 bp synthetic 
oligonucleotide (7 EcRE OLIGOS) which contains the 
ecdysone response element (EcRE) from the ecdysone- 
inducible heat shock gene hsp 27 (Riddihough and Pelham, 
1987. EMBO J. 6:3729-3734). The seven EcREs should 
confer ecdysone-inducibility to the truncated promoter, pro- 
vided that the cells transfected with this reporter plasmid 
contain the appropriate ecdysone receptor. 

This ecdysone-inducible reporter plasmid was con- 
structed by insertion of the 7 EcRE OLIGOS into plasmid 
pAdh/pgal, which is identical to pEcRE/Adh/figal except 
that it lacks the array of ecdysone response elements. The 
pAdh/pgal plasmid should therefore not be ecdysone induc- 
ible and can serve as a control. To test these expectations, 
Sch-2 cultured cells (which were shown above to contain 
endogenous ecdysone-binding activity) were transfected 
with each plasmid and examined for f-galactosidase activity 
in the presence and absence of ecdysone. The ecdysone- 
induced f-galactosidase activity in the pEcRE/Adh/Bgal 
transfected cells was 2000-fold greater than when such cells 
were not exposed to ecdysone, whereas ecdysone had little 
effect on the pAdh/pgal transfected cells. These results 
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indicate that the EcREs confer ecdysone-inducibility on the 
Pradh-34453 promoter, as expected, and that the Sch-2 cells 
contain functional ecdysone receptors. 

To test the function of the EcR receptor in such a system, 
host cells lacking functional ecdysone receptors are 
required. *Ecdysone-resistant" cells lacking ecdysone- 
binding activity, and hence, presumably, functional recep- 
tors can be produced by continuously exposing ecdysone- 
responsive cells to ecdysone during a period of several 
weeks. This ecdysone-resistant state is then maintained in 
ecdysone-free media for several months. An ecdysone- 
resistant cell line, SRS 1.5, was therefore generated by 
growing Sch-2 cells in 5x107% M ecdysone. The SRS 1.5 
cells lack significant ecdysone-binding activity. 

When these cells were transfected with the pEcRE/Adh/ 
fgal plasmid and subsequently exposed to ecdysone, very 
little ecdysone-induced f-galactosidase activity was 
observed, indicating that the cells have only trace amounts, 
if any, of functional receptors. To test whether the expression 
of the EcR gene can “rescue” this deficiency, the SRS 1.5 
cells were cotransfected with two plasmids: the ecdysone- 
inducible reporter plasmid, PEcRE/Adh/pgal, and a consti- 
tutive expression plasmid for the EcR gene, pActEcR, in 
which transcription of the EcR ORF is controlled by the 
Drosophila Actin 5C promoter, P, s< (FIG. 3). Cotransfec- 
tion with these two plasmids, followed by exposure to 
ecdysone, resulted in a dramatic induction of 
p-galactosidase activity. Thus, introduction of this EcR 
expression plasmid into the SRS 1.5 cells regenerated the 
ecdysone-inducibility they had lost. 

Methods 

Construction of the pAdh/Bgal, pEcRE/Adh/Bgal and 
pActEcR Plasmids 

Plasmid pAdh/fjgal was constructed in two steps. The 
Bglll-Scal fragment of pDA5'-34, containing nucleotides 
-34 to +53 of the Drosophila Adh distal promoter, was 
cloned into pUC18 cut with Scal and BamHI. The resulting 
plasmid was cut with EcoR1, and the EcoR1 fragment of 
cPfbxd6.2 (containing the Ubx untranslated leader and 
AUG, the Bgal open reading frame, and the SV40 splice and 
poly A signals) inserted. 

To construct pEcRE/Adh/pgal from pAdh/pgal, two 
34-residue oligonucleotides were synthesized: 

5'TCGAGAGACAAGGGTTCAATGCACT- 

TGTCCAATG3' 
3'CTCTGTTCCCAAGTTACGTGAACAGGT- 
TACAGCTS' 
These will anneal to form 30 bp duplexes with Sall com- 
patible four nucleotide overhangs at their 5' ends, as shown. 
Further annealing via the 5' overhangs allows formation of 
tandem arrays that can be inserted into pAdh/fgal at its Sall 
site just upstream from the TATA box of the truncated Adh 
promoter. When these oligonucleotides were kinased, 
annealed, ligated into Sall-cut pAdh/fgal and cloned, 
pEcRE/Adh/fgal was obtained. Restriction mapping 
showed that it contained a tandem array of seven 34 bp 
repeats, each of which contains the 23 bp ecdysone response 
element (EcRE) present in the hsp 27 gene, the remaining 11 
bp representing flanking hsp 27 sequences and the 5' over- 
hangs. 

The constitutive EcR expression plasmid, pActEcR, was 
formed by inserting the Fsp1-Hpal fragment of an EcR 
cDNA containing bp 851—4123 that contains the ORF 
encoding the EcR protein (Table 2), into the EcoRV site of 
the ActSV40BS plasmid. This expression vector was con- 
structed in two steps by inserting the Xbal-EcoR1 fragment 
of cosPneof-gal, containing the SV40 splice and poly A 
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signals, into BlueScript+KS (Stratagene) cut with SacI and 
Xbal, blunting the EcoR1 and Sacll ends. The resulting 
plasmid was digested with BamH1 and Apal, and the 
BamH1-EcoR1 fragment of pPAc was inserted, with the 
Apal and EcoR1 ends being blunted. 

Transfection and Generation of the Cell Line SRS 1.5 

The cell line SRS 1.5 was obtained by growing Schneider 
line 2 (Sch-2) cells in the presence of 2x10? M 20-OH 
ecdysone (Sigma). This treatment initially halts growth of 
Sch-2 cells, but after several weeks the adapted cells grow 
well. SRS 1.5 cells were washed in hormone-free medium 
and passed several times in hormone-free medium prior to 
their use in transfection experiments. Cells were transfected 
by the calcium phosphate technique. Cells were transfected 
with 10 ug of each plasmid used; when only a single plasmid 
was being transfected, 10 ug of pUC18 DNA was added as 
a carrier. In general, all transfections were carried out in 
duplicate. Twenty-four hours after transfection, cells that 
were to undergo hormone treatment were split into two 
dishes, one of which was treated with 2x10? M 20-OH 
ecdysone. 

B-galactosidase Assays 

Forty-eight hours after transfection, 2 ml of cells were 
washed once in PBS (137 mM NaCl, 27 mM KCl, 65 mM 
Na4HPO,, 15 mM KH,;PO,, pH 6.8), and were resuspended 
in 50 ul of 0.25 M sucrose, 10 mM Tris, pH 7.4, 10 mM 
EDTA, and repeatedly frozen in liquid nitrogen and thawed 
in a 37? C. water bath for a total of 3 freeze/thaw cycles. Cell 
debris was removed by a 10-minute centrifugation in a 
microcentrifuge at 4? C. The concentration of protein in the 
supernatant (cell extract) was determined by the Bradford 
method, with bovine serum albumin as a standard, and was 
typically 1.5-2.5 mg/ml. Extracts were assayed immediately 
or frozen and assayed up to two weeks later with no loss in 
activity. To 10 wl of extract, or an appropriate dilution, 500 
ul of assay buffer was added (0.6 mM 4-methylumbelliferyl- 
B-D-galactoside, 60 mM Na,HPO,, 40 mM NaH,PO,, 10 
mM KCl, 1.0 mM MgSO,, pH 7.0). After a 30-minute 
incubation at 37? C., reactions were stopped with 500 4d of 
300 mM glycine, 15 mM EDTA, pH 11.2. The fluorescent 
reaction product was quantified on a Perkin-Elmer LS-5B 
luminescence spectrometer, with À,,=365 nm and À,,,=450 
nm. Bal activities are given as fluorescence units per ug 
protein assayed. 

C. Specific Binding of the EcR Protein to Ecdysone 
Response Elements 

The simplest explanation of the results described in the 
preceding section is that the EcR protein generated by the 
EcR expression plasmid binds to the EcRE of the reporter 
plasmid and, in combination with ecdysone, activates the 
minimal Adh promoter in that plasmid. The following 
experiment was designed to test whether the EcR protein 
exhibits specific binding to this EcRE in vitro. 

Two plasmids were used: pUC18, which serves as the 
control, and pUC18-EcRE, which was generated by substi- 
tuting the HindII-Xbal fragment from pEcRE/Adh/ gal that 
contains the seven repeats of the 34 bp EcRE 
oligonucleotide, for the HindII-Xbal fragment of pUC18. 
Because the only difference between these two fragments is 
the seven oligonucleotide repeats, this is also the only 
difference between the two plasmids. 

The two plasmids were digested with ApaLI and Hind III, 
#p end labeled and mixed with an extract from MtEcRHy 
cells in which the EcR protein was overexpressed by Cu** 
induction (see section A, above). After a 15-minute incuba- 
tion at 25? C. to allow EcR-DNA binding to occur, affinity- 
purified anti-EcR antibody was added. The 25? C. incuba- 
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tion was continued for an additional 40 minutes, at which 
time anti-rabbit Ig-coated magnetic beads (Dupont 
Magnasort-R) were added, and the incubation continued 15 
minutes more. The beads were separated from the solution 
magnetically, similarly washed, and the DNA eluted from 
the beads in 1% SDS at 65° C. The eluted DNA was ethanol 
precipitated and fractionated by electrophoresis in an agar- 
ose gel, which was dried and autoradiographed. 

Only the fragment containing the EcRE oligonucleotide 
was specifically and efficiently registered on the 
autoradiographs, and that registration was dependent upon 
the anti-EcR antibody. Quantitative analysis of the autora- 
diographs demonstrated a 10?-fold preference for binding to 
the EcRE oligonucleotide over the average vector 
sequences, under the conditions of this assay (see Methods, 
below). 

According to the criteria stated at the beginning of this 
Experimental Example, the EcR protein clearly satisfies the 
definition of an ecdysone receptor. 

Methods 

Conditions for the DNA Binding Assay 

A quantity of 0.2 fmole of digested, labelled plasmid 
DNA was mixed with 2 ug (dI/dC) in 10 ul of TE (10 mM 
Tris HCl, pH 8.0, 1 mM EDTA), and 90 ul of the MtEcRHy 
extract, diluted to 0.9 mg/ml in EcR buffer adjusted to 180 
mM KCI, was added. After binding for 15 minutes at 25? C., 
2 ml of affinity-purified anti-EcR antibody, diluted 1.5x in 
EcR, was added, and this incubation was continued at 25? C. 
for 40 minutes, when 50 ul of anti-rabbit Ig-coated magnetic 
beads (Dupont Magnasort-R), exchanged into 180 mM KCI 
EcR buffer, was added and the incubation continued for 15 
minutes. 

The beads were washed twice in 400 ul 180 mM KCI EcR 
buffer, and DNA was eluted from the beads by soaking twice 
in 200 ul 1% SDS in TE at 65? C. The eluted DNA was 
ethanol precipitated and run on an agarose gel, which was 
dried and autoradiographed. As controls, one half of the 
input DNA (0.1 fmole) was run on the gel for comparison, 
and the binding assay was carried out leaving out the 
antibody. 


EXAMPLE IV 
RECEPTOR GENE MUTAGENESIS 


Mutations in the steroid receptor superfamily genes can 
alter their function in two ways. Most obviously, they alter 
the sequences encoding the receptor proteins and thus alter 
the receptor function. Alternatively, they can alter the 
expression of these genes—an alteration that can be at any 
level of that expression from transcription of the gene to the 
translation of its mRNA(s). Such mutations can change 
when the gene is expressed during development or change 
the tissue and cell distribution of that expression. Thus, they 
can profoundly change the course of development. 
Furthermore, these mutations provide information about the 
regulation of receptor gene expression, just as mutations that 
alter the structure of the receptors encoded by these genes 
provide information about the genes whose expression these 
receptor proteins control. In particular, mutations that alter 
receptor gene expression can lead to the identification of the 
proteins and other regulatory molecules that control that 
expression. Clearly, mutagenesis of insect steroid receptor 
superfamily genes provides an important avenue leading to 
an ability to interfere in a high specific manner with insect 
development and thus to control insect infestations delete- 
rious to human health and agriculture. 

We have carried out mutagenesis experiments for two 
Drosophila members of the steroid receptor superfamily 
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genes, E75 and E74, that we have cloned and characterized 
with respect to their expression. In this experimental 
example, mutagenesis of the E75 gene is described. 

A. Deletion Mutations 

In Drosophila, genetic analysis for a given locus—in this 
case, the early puff locus at 75B that houses the E75 
gene—generally depends upon the isolation of deletions of 
all or part of that locus. This is because such deletions 
greatly facilitate the subsequent isolation of point and other 
small mutations within the locus. By isolating mutations that 
are revertants to the neighboring dominant Wrinkled (W) 
mutations, we have isolated and molecularly mapped the 
boundaries within our chromosomal walk (see Experimental 
Example I) of two deletions, W** and WÂ19, generated by 
gamma ray mutagenesis, the preferred way of generating 
such large alterations of genomic structure. One of these, 
W#10 extends distally from Wrinkled to cover the entire 
E75 gene; and the other, W^", extends to a point about 90 kb 
upstream of the 5' end of the 50 kb E75A transcription unit 
and does not include the E75 gene. 

An F2 screen was then employed to screen for gamma 
ray-induced mutations mapping to the 200 kb distal region 
that is included in the W^? deletion but not the W^^ 
deletion. This screen resulted in the isolation of five mem- 
bers of a single lethal complementation group that molecular 
mapping data demonstrate represents the E75 gene. The 
most useful of these five mutations is the E75*** mutation. 
Molecular mapping of this mutation demonstrated that it is 
a 105 kb region that includes all of the E75 gene. It is useful 
because it provides an extremely efficient method to screen 
for other E75 mutations, i.e., by screening for mutations that 
cannot complement this deletion mutation. 

B. E75 Mutations Generated by Ethyl Methane Sulfonate 

The chemical mutagen ethyl methane sulfonate, or EMS, 
was used for this screen, as it is the preferred method for 
generating point or small mutations. An F2 screen of 15,000 
lines resulted in the isolation of 23 penetrant mutations 
within the 105 kb region of the E75*** deletion, all of which 
turned out to be alleles of E75. It appears that this 105 kb 
region was saturated by this screen in respect to lethal 
complementation groups, and hence, E75 appears to be the 
only lethal complementation group in this region. Adding 
the five E75 mutations described above, a total of 28 
penetrant E75 alleles have thus been isolated, several of 
which are temperature-sensitive alleles. 

Inter se complementation studies among these alleles and 
examination of their phenotypes reveal a complex comple- 
mentation group—a complexity that probably results from 
the fact that the E75 gene contains two overlapping tran- 
scription units, a 50 kb E75A unit and a 20 kb E75B unit that 
occupies the 3' end of the E75A unit (see Experimental 
Example I and Table 1). These alleles can be roughly divided 
into two groups: (1) those that cause lethality in early 
development—during the latter part of embryogenesis or 
during early larval development, and (2) those that cause 
lethality late in development during the prepupal or pupal 
stages. 

This division correlates with the stages when the E75A 
and E75B units are expressed. Thus, E75A transcription is 
associated with each of the six pulses of ecdysone, including 
those that mark the embryonic and early larval stages. By 
contrast, E75B mRNAs are not observed until the end of the 
last larval stage, being particularly abundant during the 
pupal stage. This correlation invites the speculation that the 
early lethal mutations affect the expression of the E75A unit 
and its E74A protein, and that the late lethal mutations 
specifically affect the expression of the E75B unit and its 
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E75B protein. This proposition can be tested by detailed 
molecular mapping of these mutations and further exami- 
nation of their phenotypes at the molecular level to deter- 
mine the causes of lethality. 

The mutants described here provide a foundation for the 
further genetic analysis of the E75 gene that will allow 
exploration of the requirements for appropriate E75 expres- 
sion and function and will identify structural and functional 
domains of E75. Some of the future E75 studies will best be 
performed by its in vitro manipulation, followed by trans- 
formation of the constructs back into Drosophila. Finally, it 
will be desirable to identify interacting genetic loci— 
interactions that may occur at the level of regulation of E75 
expression or at the level of interaction of the E75 proteins 
with those encoded by other genes. Such interactive genetic 
loci can be identified via the isolation of mutations that act 
as suppressors or enhancers of the E75 mutations. 

Methods 

Strains, Markers and Chromosomes 

For this aspect of the invention, the following strains, 
markers and chromosomes were used. Tu? was described by 
Lindsley (Lindsley, 1973. DIS 50:21). All other strains and 
mutations are as described (Lindsley, and Grell, 1968. 
Genetic Variation of Drosophila melanogaster, Publication 
627, Carnegie Institute of Washington, Washington, D.C.). 
ru h W** e? ro ca was constructed by recombination between 
ru h W^" sbd? Tu? and st sbd? e? ro ca. The st in ri p? sbd? 
chromosome was constructed by recombination of st in ri p” 
with sbd2, in order to allow marking of this chromosome 
over WA^ and W*?°, and homozygosed by crossing to TM3, 
backcrossing to TM3, and mating of isogeneic sibling prog- 
eny. The homozygosed st p” ell line was a kind gift of Ken 
Burtis. Matthew Scott provided Antp" and ns^^*. Allan 
Shearn provided the pupal lethals X19, q26, O13B, 8m12, 
iX-14, 2612, m45, p4, q30L, mz416, 13m115, 052 and 
wg49. All strains used to construct the strains described 
above and other strains were obtained from the Bowling 
Green and Caltech stock centers. 

TM1, TM3 and TM6B (Lindsley, and Grell, 1968. 
Genetic Variation of Drosophila melanogaster, Publication 
627, Carnegie Institute of Washington, Washington, D.C.) 
are balancer chromosomes carrying recessive lethal muta- 
tions along with multiple inversions to suppress recombi- 
nation. This allows the maintenance, as a heterozygote, of a 
recessive lethal chromosome in its original state. These 
chromosomes are also marked with convenient visible mark- 
ers. 

Quantitative Southern Blot Mapping for Detection of 
Mutant Lesions 

DNA was prepared from adult flies (about 50) by dounc- 
ing in 1 ml of 10 mM Tris HCl, pH 7.5, 60 mM NaCl, 10 mM 
EDTA, 0.15 mM spermine, 0.2 mg/ml proteinase K. The 
homogenate was added to an equal volume of 0.2 M Tris 
HC1, pH 9.0, 30 mM EDTA, 2% SDS, 0.2 mg/ml proteinase 
K, incubated at 37? C. for 1 hour, and then extracted twice 
with buffer-saturated phenol and once with 24:1 chloroform/ 
isoamyl alcohol. DNA was EtOH precipitated twice, hook- 
ing the pellet out without centrifugation. Southern blot 
hybridization was as described (Segraves, W. et al., 1984. J. 
Mol. Biol. 175:1-17). Where restriction fragment length 
polymorphism was not used in order to distinguish the 
parental chromosome from the balancer chromosome, quan- 
titation of band intensity on genomic Southerns was 
achieved using a scanning densitometer. By using a control 
probe outside the mutant region, the amount of DNA in each 
track was internally controlled. Comparison of deficiency 
heterozygote to wild type bands, when normalized to a 
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control band in this way, gives little deviation from the 
expected 1:2 ratio. 
Molecular Cloning of Mutant Lesions 

Restriction fragments of the appropriate size were isolated 
by preparative low melting agarose (FMC) electrophoresis 
of about 20 ug of restricted genomic DNA. The 6 kb W^* 
Xho I fragment was cloned into Xho I-cleaved ASE6ABam 
which is propagated as a plasmid in order to grow the vector 
and cannot be packaged without an insert. The 18 kb W^? 
Sall fragment was cloned into the Sall site of AEMBL3, 
cleaved also with EcoRI for the biochemical selection 
method of prevention of propagation of non-recombinant 
clones. The 7 kb EcoRI fragment containing the x37 break- 
point was cloned into EcoRI-cleaved 2.607. Plating of 
recombinants on the hflA strain RY1073 prevented plaque 
formation by non-recombinant phage. The 14 kb x48 EcoRI 
fragment was cloned into the EcoRI site of XEMBLA, which 
had been cleaved with BamHI to utilize the “biochemical 
selection" for recombinants. The breakpoint fragments of 
x44 and the recipient fragment were cloned into ASEGABam. 
Libraries were packaged using À in vitro packaging extracts 
prepared as described in Hohn (Hohn,  ., 1979. Methods 
Enzymol. 68:299—303). After demonstration that each of the 
libraries gave a significant number of plaques only when 
inserts were included in the ligation, they were screened 
using restriction fragments capable of detecting the break- 
point clones. 

Gamma Ray Mutagenesis 

Adult males of the strain ru h W sbd? Tu? or st in ri pP sbd? 
were irradiated in plastic vials with 5000 rad of gamma rays 
from a Cs? source at a dose rate of 4300 rad/minute. These 
were then mated to virgins of the appropriate strain which 
were allowed to lay eggs for five days. 

EMS Mutagenesis 

The primary lesion in EMS-induced mutations of bacteria 
and yeast is an alkylation-induced transition of guanine to 
adenine; most EMS-induced point mutations in Drosophila 
can similarly be explained on this basis. This change would 
be expected to convert, on the complementary strand, a C in 
the opa repeat element to a T, creating an in-frame stop 
codon (CAGCAA to UAGCAA or CAGUAA). 
(Ethylnitrosourea, ENU, which has been reported to yield a 
higher number of mutations for a given amount of sterility, 
is also an alkylator; however, considerably more stringent 
precautions must be taken in handling this mutagen.) 

EMS was administered at 0.025 M to unstarved 1.5—5 
day-old males in 1% sucrose solution (1.5 ml on two slips 
of Whatman #2 in a 350 ml milk bottle). Starvation of the 
males for 8 hours before EMS administration resulted in 
unacceptable levels of sterility, and males of the st p? e! 
strain readily fed upon the EMS/sucrose solution without 
starvation. Mutagenesis was monitored by crossing 
mutagenized males to attached-X FMA3 females. Other 
mutants seen in this screen included a large number of ca 
alleles (many mosaic) seen over TM6B in the F1 and F2 
generations, a dominant brown allele, and two new mutants, 
Wink, a third chromosome dominant mutation resembling 
Bar, and a third chromosome dominant Curly-like mutation. 
Wink is easily scored (RK1), has complete penetrance, and 
is quite healthy over TM6B. 

In the initial screen, vials were scored as mutant if they 
had fewer than 2596 as many deficiency heterozygote as 
balancer heterozygote flies. On retesting, this was revised to 
50% of the level seen in control crosses. Balancer heterozy- 
gotes were approximately two thirds as viable as deficiency 
heterozygotes. 
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In Situ Hybridization and Cytological Analysis 

In situ hybridization of polytene chromosomes was car- 
ried out as described in Experimental Example I (see 
Methods, section A). Cytological analysis was performed by 
squashing larval salivary glands in lactoacetic orcein (2% 
orcein, 50% acetic acid, 30% lactic acid). 

Although the present invention has been described in 
some detail by way of illustration and example for purposes 
of clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the 
scope of the claims. 

What is claimed is: 

1. An isolated, recombinant polynucleotide segment 
encoding an insect ecdysone receptor, 

wherein the ecdysone receptor binds both to an ecdysone 

response element and to ecdysone; wherein the seg- 

ment comprises: 

(1) a nucleotide sequence encoding a steroid hormone 
receptor zinc finger DNA binding domain that binds 
an ecdysone response element, wherein the DNA 
binding domain comprises a plurality of cysteine 
residues, wherein the nucleotide sequence encoding 
the DNA binding domain hybridizes under selective 
hybridization conditions to nucleotides 1858 to 2055 
of the DNA binding domain of Drosophila EcR of 
Table 2; and 

(2) a nucleotide sequence encoding a hormone binding 
domain that binds ecdysone, wherein the nucleotide 
sequence encoding the hormone binding domain 
hybridizes under selective hybridization conditions 
to nucleotides 2359 to 3021 of the ecdysone binding 
domain of Drosophila EcR of Table 2; 

wherein the selective hybridization conditions com- 
prise a hybridization step in less than 500 mM salt 
and at least 37? C., and washing in 2xSSPE at 63? C. 

2. The isolated recombinant polynucleotide segment of 
claim 1 wherein the selective hybridization conditions com- 
prise less than 200 mM salt and at least 37? C. for the 
hybridization step. 

3. The isolated recombinant polynucleotide segment of 
claim 1 wherein the selective hybridization conditions com- 
prise 2xSSPE and 63? C. for both hybridization and wash 
steps. 

4. An expression vector comprising the isolated recom- 
binant polynucleotide segment of claim 1, wherein the 
isolated recombinant polynucleotide segment is operably 
linked to transcription regulatory elements. 

5. An isolated host cell transfected with the expression 
vector of claim 4. 

6. The isolated host cell of claim 5 which is a plant cell. 

7. The isolated host cell of claim 5 which is a bacterium. 

$8. The isolated host cell of claim 5 which is an insect cell. 

9. The isolated host cell of claim 5 which is a mammalian 
cell. 

10. A method for regulating expression of a polypeptide 
in a host cell that lacks an ecdysone receptor and that is 
insensitive to ecdysone, the method comprising the steps of: 

(I) transfecting the host cell with a first and second 

expression vector wherein: 

(A) the first expression vector is the expression vector 
of claim 4; and 

(B) the second expression vector comprises second 
transcription regulatory sequences operative in the 
transfected host cell, wherein the second sequences 
comprise the ecdysone response element, wherein 
the second sequences are operably linked to a nucle- 
otide sequence encoding the polypeptide for which 
regulation of expression is desired; 
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wherein the ecdysone receptor is produced in the 
transfected host cell by expression of the isolated, 
recombinant polynucleotide segment of the first 
expression vector; and 

(II) exposing the transfected host cell to ecdysone, 

whereupon the transfected host cell transcribes the nucle- 

otide sequence encoding the polypeptide and expresses 
the polypeptide, thereby regulating the expression of 
the polypeptide in the transfected host cell. 

11. The method of claim 10 wherein the selective hybrid- 
ization conditions comprise less than 200 mM salt and at 
least 37? C. for the hybridization step. 

12. The method of claim 10 wherein the selective hybrid- 
ization conditions comprise 2xSSPE and 63? C. for both 
hybridization and wash steps. 

13. The method of claim 10 wherein the recombinant 
polynucleotide segment encodes the insect ecdysone recep- 
tor comprising the amino acid sequence of Drosophila EcR 
set forth in Table 2. 

14. The method of claim 10 wherein the host cell is a 
bacterium. 

15. The method of claim 10 wherein the host cell is an 
insect cell. 

16. The method of claim 10 wherein the host cell is a 
mammalian cell. 

17. The method of claim 10 wherein the host cell is a plant 
cell. 

18. A recombinant polynucleotide comprising a nucleic 
acid segment encoding a fusion protein, wherein the fusion 
protein comprises: 

(1) a first nucleotide sequence encoding a DNA binding 

domain; and 

(2) a second nucleotide sequence encoding a hormone 

binding domain that binds ecdysone, wherein the sec- 

ond nucleotide sequence hybridizes under selective 

hybridization conditions to nucleotides 2359—3021 of 

the ecdysone binding domain of Drosophila EcR of 

Table 2; 

wherein the selective hybridization conditions com- 
prise a hybridization step in less than 500 mM salt 
and at least 37? C., and a washing step in 2xSSPE at 
63? C., and wherein the DNA binding domain and 
the hormone binding domain are from different ste- 
roid receptor superfamily members. 

19. The recombinant polynucleotide of claim 18 wherein 
the selective hybridization conditions comprise less than 200 
mM salt and at least 37? C. for the hybridization step. 

20. The recombinant polynucleotide of claim 18 wherein 
the selective hybridization conditions comprise 2xSSPE and 
63? C. for both hybridization and wash steps. 

21. The recombinant polynucleotide of claim 18 wherein 
the hormone binding domain encoded by the second nucle- 
otide sequence comprises amino acids 431—651 of Table 2. 

22. The recombinant polynucleotide of claim 18 wherein 
the second nucleotide sequence comprises nucleotides 
2359-3021 of Table 2. 

23. The recombinant polynucleotide of claim 18 wherein 
the DNA binding domain is the DNA binding domain of an 
insect ecdysone receptor. 

24. An expression vector comprising the recombinant 
polynucleotide of claim 18 operably linked to transcription 
regulatory elements. 

25. A host cell transfected with the expression vector of 
claim 24. 

26. The host cell of claim 25 which is a plant cell. 

27. The host cell of claim 25 which is a bacterium. 

28. The host cell of claim 25 which is an insect cell. 
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29. The host cell of claim 25 which is a mammalian cell. 
30. A method for regulating expression of a polypeptide 
in a host cell that lacks an ecdysone receptor and that is 
insensitive to ecdysone, the method comprising the steps of: 


(D transfecting the host cell with a first and a second 
expression vector wherein: 

(A) the first expression vector is the expression vector 
of claim 24; 

(B) the second expression vector comprises second 
transcription regulatory sequences operative in the 
transfected host cell, wherein the second sequences 
comprise the ecdysone response element which 
binds the DNA binding domain of the fusion protein 
encoded by the first expression vector, wherein the 
second sequences are operably linked to a nucleotide 
sequence encoding the polypeptide for which regu- 
lation of expression is desired; 

wherein the fusion protein is produced in the trans- 
fected host cell by expression of the isolated, recom- 
binant polynucleotide segment of the first expression 
vector; and 


(II) exposing the transfected host cell to ecdysone, 


whereupon the transfected host cell transcribes the nucle- 
otide sequence encoding the polypeptide and expresses 
the polypeptide, thereby regulating the expression of 
the polypeptide in the transfected host cell. 
31. The method of claim 30 wherein the host cell is a 
bacterium. 
32. The method of claim 30 wherein the host cell is an 
insect cell. 
33. The method of claim 30 wherein the host cell is a 
mammalian cell. 
34. The method of claim 30 wherein the host cell is a plant 
cell. 
35. An isolated, recombinant polynucleotide segment 
encoding a hormone binding domain of an insect steroid 
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receptor, wherein the hormone binding domain binds 
ecdysone, wherein the polynucleotide segment encoding the 
hormone binding domain hybridizes to nucleotides 2359 to 
3021 of the ecdysone binding domain of Drosophila EcR of 
Table 2 under hybridization conditions comprising a hybrid- 
ization step in less than 500 mM salt and at least 37? C., and 
a washing step in 2xSSPE at 63? C. 

36. The recombinant polynucleotide of claim 35 wherein 
the hybridization conditions comprise less than 200 mM salt 
and at least 37? C. for the hybridization step. 

37. The recombinant polynucleotide of claim 35 wherein 
the hybridization conditions comprise 2xSSPE and 63? C. 
for both hybridization and wash step. 

38. An expression vector comprising the recombinant 
polynucleotide of claim 35 operably linked to transcription 
regulatory elements. 

39. A host cell comprising the expression vector of claim 
35, wherein the transcription regulatory elements are opera- 
tive in the cell. 

40. The host cell of claim 39 which is a plant cell. 

41. The host cell of claim 39 which is a bacterium. 

42. The host cell of claim 39 which is an insect cell. 

43. The host cell of claim 39 which is a mammalian cell. 

44. An isolated, recombinant polynucleotide segment 
encoding a hormone binding domain of an insect ecdysone 
receptor, wherein the hormone binding domain exhibits 
saturable binding to ecdysone of an ecdysteroid, wherein the 
polynucleotide segment encoding the hormone binding 
domain hybidizes to nucleotides 2359 to 3021 of the ecdys- 
one binding domain of Drosophila EcR of Table 2 under 
hybridization conditions comprising a hybridization step in 
less than 500 mM salt and at least 37? C., and a washing step 
in 2xSSPE at 63? C. 

45. The polynucleotide of claim 44, wherein the ecdys- 
teroid is ponasterone A. 
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